This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the appHcant. 

Defects in the images may include (but are not Umited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents mil not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



4^0 



CIT009-PCT1 



PCX 



World intellectual property organization 

Intemational Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ^ : 

C07D 207/34, 233/90, A61K 31/415, 
C07D 403/14, C12Q 1/68 



Al 



(11) International Publicaaon Number: WO 98/37066 

(43) International PubUcaUon Date: 27 August 1998 (27.08.98) 



(21) International Application Number: PCT/US98/01006 

(22) Internationa! Filing Date: " 21 January 1998 (21.01.98) 



(30) Priority Data: 

PCrnUS97/03332 
(34) Countries for 

international 
60/043,444 
60/042,022 
08/837,524 
08/853.522 
PCT/US97/12722 
(34) Countries for 

international 



20 February 1997 (20.02.97) WO 
which the regional or 

application was filed: US et al. 

8 April 1997 (08.04.97) US 

16 April 1997 (16.04.97) US 

21 April 1997 (21.04.97) US 
8 May 1997 (08.05.97) US 
21 July 1997 (21.07.97) WO 

which the regional or 

application was filed: US et al. 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Applications 

US 08/853.522 (CIP) 

Filed on 8 May 1997 (08.05.97) 

US 08/837.524 (CIP) 

Filed on 21 April 1997 (21.04.97) 

US 08/607.078 (CIP) 

Filed on 26 Februaty 1996 (26.02.96) 

US 60/042,022 (CIP) 

Filed on 16 April 1997 (16.04.97) 



US 

Filed on 



60/043,444 (CIP) 
8 April 1997 (08.04.97) 



(71) Applicant (for all designated States except US): CALIFORNIA 

INSTITUTE OF TECHNOLOGY [US/US); Pasadena, CA 
91 125 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BAIRD, Eldon. E. 
[US/US]; 255 S. Madison Avenue #5, Pasadena. CA 91 101 
(US). DERVAN. Peter. B. [US/USl; 1235 St. Albans Road. 
San Marino, CA 91108 (US). 

(74) Agent: McDONNELL. John, J.; McDonnell Boehnen Hulbert 
& Berghoff, 300 South Wacker Drive, Chigago, IL 60606 
(US). 



(81) Designated States: AL. AM, AT, AU. AZ. BA, BB. BG. BR. 
BY, CA. CH. CN. CU. CZ. DE. DK. EE. ES. FI. GB. GE. 
GH. GW. HU. ID, IL. IS, JP. KE, KG. KP. KR, KZ, LC, 
LK, LR. LS. LT. LU. LV, MD, MG. MK, MN. MW. MX. 
NO. NZ, PL. PT, RO. RU, SD, SE. SG, SI. SK, SL. TJ, TM, 
TR, TT. UA. UG. US, UZ. VN. YU. ZW. ARIPO patent 
(GH, GM. KE. LS, MW, SD, SZ, UG, ZW). Eurasian patent 
(AM, AZ. BY, KG. KZ. MD, RU. TJ. TM). European patent 
(AT, BE. CH. DE, DK, ES. FI. FR, GB, GR, IE, IT, LU. 
MC, NL, PT, SE). OAPI patent (BF, BJ. CF. CG, CI, CM. 
GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Titie: IMPROVED POLYAMIDES FOR BINDING IN THE MINOR GROOVE OF DOUBLE STRANDED DNA 



(57) Abstract 

The invention encompasses improved polyamides for binding to specific nucleotide sequences in the minor groove of double 
stranded DNA. The 3-hydroxy-N-methylpyrroIc/N-mcthylpyrrole carboxamide pair specifically recognizes the T.A base pair, while tfie 
N-methylpyfrole/3-hydroxy-N-methylpyrrole pair recognizes A.T nucleotide pairs. Similarly, an N-methylimidizole/N-methylpynole 
carboxamide pair specifically recognizes the G.C nucleotide pair, and the N-mediylpyrrole/N-methylimidizole carboxamide pair recognizes 
the CG nucleotide pair. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT, 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Inland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Uixembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


Gil 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BR 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Builcina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


•Hirkcy 


BG 


Bulgaria 


HU 


Hungaiy 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IB 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UC 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vict Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CI[ 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


Cftle d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Foitugal 






cu 


Cuba 


KZ 


Kazokstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







\VO 98/37066 



PCTAJS98/01006 



IMPROVED POLYAMIDES FOR BINDING IN THE MINOR 
GROOVE OF DOUBLE STRANDED DNA 

The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 
26453, 27681 and 47530 awarded by the National Institute of Health. 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of PCT/US97/03332 filed February 20, 1997, 
Serial No. 08/853,522 filed May 8, 1997 and PCT/US 97/12722 filed July 21, 1997 which are 
continuation-in-part applications of Serial No. 08/837,524, filed April 21, 1997, Serial No. 
08/607,078, filed February 26, 1996, provisional application Serial No. 60/042,022, filed April 
16, 1997 and provisional application Serial No. 60/043,444, filed April 8, 1997. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates to polyamides which bind to predetermined sequences in the 
minor groove of double stranded DNA. 

Description of the Related Art 

The design of synthetic ligands that read the information stored in the DNA double helix 
has been a long standing goal of chemistry. Cell-permeable small molecules which target 
predetermined DNA sequences are useful for the regulation of gene-expression. 
Oligodeoxynucleotides that recognize the major groove of double-helical DNA via triple-helix 
formation bind to a broad range of sequences with high affinity and specificity. Although 
oligonucleotides and their analogs have been shown to interfere with gene expression, the triple 
helix approach is limited to purine tracks and suffers from poor cellular uptake. The 
development of pairing rules for minor groove binding polyamides derived from N- 
methylpyrrole (Py) and N-methylimidazole (Im) amino acids provides another code to control 
sequence specificity. An Im/Py pair distinguishes G^C from C«G and both of these from A«T 
or T«A base pairs. Wade, W.S., Mrksich, M. & Dervan, P.B. describes the design of peptides 
that bind in the minor groove of DNA at 5'-(A,T)G(A,T)C(A,T)-3' sequences by a dimeric 
side-by-side motif. J. Am, Chem. Soc, 114, 8783-8794 (1992); Mrksich, M. et al describes 
antiparallel side-by-side motif for sequence specific-recognition in the minor groove of DNA by 
the designed peptide l-methylimidazole-2-carboxamidenetropsin. Proc, Natl Acad, Sci. USA 
89, 7586-7590 (1992); Trauger, J.W., Baird, E. E. Dervan, P.B. describes the recognition of 
DNA by designed ligands at subnanomolar concentrations. Nature 382, 559-561 (1996). A 
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Py/Py pair specifies A«T fixnn G^C but does not distinguish A»T from T»A. Pelton, J.G. & 
Wemmer, D,E. describes the structural characterization of a 2-1 distamycin A- 
d(CGCAAATTTGGC) complex by two-dimensional NMR. Proc. Natl. Acad, ScL USA 86, 
5723-5727 (1989); White, S,, Baird, E. E. & Dervan, P.B. Describes the effects of the A«T/T«A 

5 degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996); White, S., Baird, E. E. & Dervan, P. B. describes the 
pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
CAem. & BioL 4, 569-578 (1997); White, S., Baird, E. E. & Dervan, P.B. describes the 5'-3' N- 
C orientation preference for polyamide binding in the minor groove. In order to break this 

10 degeneracy, a new aromatic amino acid, 3-hydroxy-N-methylpyrrole (Hp) incorporated into a 
polyamide and paired opposite Py, has been found to discriminate A^^ from T*A. The 
replacement of a single hydrogen atom on the pyrrole with a hydroxy group in a Hp/Py pair 
regulates affinity and specificity of a polyamide by an order of magnitude. Utilizing Hp 
together with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, 

15 Hp/Py, and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor 
groove of DNA. 

SUMMARY OF THE INVENTION 

20 The invention encompasses improved polyamides for binding to the minor groove of 

double stranded ("duplex") DNA. The polyamides are in the form of a hairpin comprising two 
groups of at least three consecutive carboxamide residues, the two groups covalently linked by 
an aliphatic amino acid residue, preferably y-aminobutyric acid or 2,4 diaminobutyric acid, the 
consecutive carboxamide residues of the first group pairing in an antiparallel manner with the 

25 consecutive carboxamide residues of the second group in the minor groove of double stranded 
DNA. The improvement relates to the inclusion of a binding pair of Hp/Py carboxamides in the 
polyamide to bind to a T»A base pair in the minor groove of double stranded DNA or Py/Hp 
carboxamide binding pair in the polyamide to bind to an A»T base pair in the minor groove of 
double stranded DNA. The improved polyamides have at least three consecutive carboxamide 

30 pairs for binding to at least three DNA base pairs in the minor groove of a duplex DNA 
sequence that has at least one A»T or T«A DNA base pair, the improvement comprising 
selecting a Hp/Py carboxamide pair to correspond to a T«A base pair in the minor groove or a 
Py/Hp carboxamide pair to bind to an A^^ DNA base pair in the minor groove. Preferably the 
binding of the carboxamide pairs to the DNA base pairs modulates the expression of a gene. 

35 

In one preferred embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
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embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to at 
least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
poiyamides have four carboxamide binding pairs that will distinguish A«T, T*A, C«G and G«C 
base pairs in the minor groove of a duplex DNA sequence. The duplex DNA sequence can be a 
5 regulatory sequence, such as a promoter sequence or an enhancer sequence, or a gene sequence, 
such as a coding sequence or a non-coding sequence. Preferably, the duplex DNA sequence is a 
promoter sequence. 

The preparation and the use of poiyamides for binding in the minor groove of double 
10 stranded DNA are extensively described in the art. This invention is an improvement of the 
existing technology that uses 3-hydroxy-N-methylpyrrole to provide carboxamide binding pairs 
for DNA binding poiyamides. 

The invention encompasses poiyamides having y-aminobutyric acid or a substituted y- 
15 aminobutyric acid to form a hairpin with a member of each carboxamide pairing on each side of 
it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the poiyamides may contain an aliphatic 
amino acid residue, preferably a P-alanine residue, in place of a non-Hp carboxamide. The p- 
alanine residue is represented in formulas as p. The P-alanine residue becomes a member of a 
20 carboxamide binding pair. The invention further includes the substitution as a p»p binding pair 
for non-Hp containing binding pair. Thus, binding pairs in addition to the Hp/Py and Py/Hp are 
Im/p, p/Im, Py/p, p/Py, and p/p. 

The poiyamides of the invention can have additional moieties attached covalently to the 
25 polyamide. Preferably the additional moieties are attached as substituents at the amino terminus 
of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4-diaminobutyric 
acid residue. Suitable additional moieties include a detectable labeling group such as a dye, 
biotin or a hapten. Other suitable additional moieties are DNA reactive moieties that provide 
for sequence specific cleavage of the duplex DNA. 

30 

Brief Description of the Drawings 

Figure 1 illustrates the structure of polyamide 1^ 2a.and 3. 
Figure 2 illustrates the pairing of poiyamides to DNA base pairs. 
35 Figure 3 illustrates the DNase footprint titration of compounds 2 and 3. 

Figure 4 illustrates a list of the structures of representative Hp containing poiyamides. 
Figure 5 illustrates the synthesis of a protected Hp monomer for solid phase synthesis. 
Figure 6 illustrates the solid phase synthesis of polyamide 2. 
Figure 7 illustrates the IH-NMR characterization of polyamide 2. 
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Figure 8 illustrates the Mass spectral characterization of polyamide 2. 

Figure.9 illustrates IH-NMR characterization of synthesis purity. 

Figure 1 0 illustrates DNasel footprint titration experiment. 

Figure 1 1 illustrates the synthesis of bifunctional conjugate of polyamide 2. 
5 Figure 12 illustrates affinity cleaving evidence for oriented hairpin formation. 

Figure 13 illustrates increased sequence specificity of Hp/Py containing polyamides. 

Figure 14 illustrates 8-ring hairpin polyamides which target 5'-WGTNNW-3' sites. 

Figure 15 illustrates 8-ring hairpin polyamides which target 5'-WGANNW-3' sites. 

Figure 16 illustrates 8-ring hairpin polyamides which target 5'-WGGNNW-3* sites. 
10 Figure 17 illustrates 8-ring hairpin polyamides which target 5'-WGCNNW-3' sites. 

DETAILED DESCRIPTION OF THE INVENTION 

Within this application, unless otherwise stated, definitions of the terms and illustration 
15 of the techniques of this application may be found in any of several well-known references such 
as: Sambrook, J., et al.y Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., erf., Gene Expression Technology^ Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 
Deutshcer, M.P., ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, et 
20 al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, R.I., Culture of Animal Cells: A Manual of Basic Technique, 2^ Ed,^ Alan 
Liss, Inc. New York, NY (1987); Murray, E.J., ed,. Gene Transfer and Expression Protocols y 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University 
Press, New York (1997). 

25 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 
region, non-coding regions preceding ("leader") and following ("trailer") the coding region, as 

30 well as intervening non-coding sequences ("introns") between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" '5 to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 
known, including sequences involved with the binding of transcription factors, including 

35 response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 
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yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5 '-3' or 3 '-5') and in any location (upstream 
or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the regulatory 
5 sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in the art 
10 as a silencer, A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

15 It is to be understood that the polyamides of this invention bind to double stranded DNA 

in a sequence specific manner. The function of a segment of DNA of a given sequence, such as 
5'-TATAAA-3', depends on its position relative to other functional regions in the DNA 
sequence. In this case, if the sequence 5*-TATAAA-3' on the coding strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 

20 of the promoter region (Lewin, Genes VI pp. 831-835). On the other hand, if the sequence 5'- 
TATAAA-3' is downstream of the transcription start site in a coding region and in proper 
register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI, pp. 213-215). 

25 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
such as RNA polymerase, transcription factors, TBF, TFIIIB and other proteins. The effect on 
gene expression of polyamide binding to a segment of double stranded DNA is believed to be 
30 related to the function, e.g., promoter, of that segment of DNA. 

It is to be understood by one skilled in the art that the improved polyamides of the 
present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 
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5,578,444 describes numerous promoter targeting sequences from which base pair sequences 
for targeting an improved polyamide of the present invention may be identified. 

It is generally understood by those skilled in the art that the basic stmcture of DNA in a 
5 living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in conunon molecular 
biology references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a decrease 
10 in gene expression, a effective quantity of one or more polyamide is contacted with the cell and 
internalized by the cell. The cell may be contacted in vivo or in vitro. Effective extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et aL, Nature 387 202-205 (1997). To 
determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
15 cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or medium by detecting the amoimt of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
20 * detecting the amount of messenger RNA present as determined by various techniques, including 
northern blot and RT-PCR. 

Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 

25 fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 

30 RNA present as determined by various techniques, including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found. 
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in Remington: The Science and Practice of Pharmacy, 19th ed.. Mack Publishing Co.. Easton, 
PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
5 pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutically 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
solid, liquid, gel, or aerosol form. The polyamide composition of the present invention may be 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
10 topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrastemal, infusion techniques or intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
15 compound, the intended use, the nature and severity of the condition being treated or diagnosed, 
the age, weight, gender, physical condition and mental acuity of the intended recipient as well 
as the route of administration. Such considerations are within the purview of the skilled artisan. 
Thus, the dosage regimen may vary widely, but can be determined routinely using standard 
methods. 

20 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Alternatively, polyamides linked to a 
25 suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

The design of bifunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
30 specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
bifunctional conjugates useful in molecular biology, genomic sequencing, and human medicine. 
Polyamides of this invention can be conjugated to a variety of functional molecules, which can 
be independently chosen from but is not limited to arylboronic acids, biotins, polyhistidines 
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comprised from about 2 to 8 amino acids, haptens to which an antibody binds, solid phase 
supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, bromoacetamide, 
iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, 
anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 

5 4-(psoraen-8-ylpxy)-butyrate, tartaric acid, (+)-a-tocopheraI, psoralen, EDTA, methidium, 
acridine, Ni(II)»Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 
bifiinctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, DNA photocleavage. Such bifimctional polyamides are usefiil for 
DNA detection by providing a polyamide linked to a detectable label. Detailed instructions for 

10 synthesis of such bifimctional polyamides can be found in copending U.S. provisional 
application 60/043,444, the teachings of which are incorporated by reference. 

DNA complexed to a labeled polyamide can then be detennined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
15 a polyamide linked to biotin can be detected by a streptavidin / alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, for 
assaying for the presence of the double stranded DNA sequence bound by the polyamide of this 
invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body fluid 
20 samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, and 
preferably the amount, of the double stranded DNA sequence bound by the polyamide in the 
sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one 
25 assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
packaged reagent(s) are also typically included. As used herein, the term "package" refers 
to a solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
30 milligram quantities of a contemplated polyamide or it can be a microliter plate well to which 
microgram quantities of a contemplated polypamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 
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periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
5 labels, such as biotin,' and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 

Figure 1 shows representative structures of polyamides. ImlmPyPy-y-ImPyPyPy-P-Dp 
N (1), ImlmPyPy-y-lmHpPyPy-P-Dp (2), and ImlmHpPy.y-ImPyPyPy-P-Dp (3). (Hp = 3- 

10 hydroxy-N-methylpyrrole, Im = N-methylimidazole, Py = N-methylpyrrole, p == P-alanine, y = 
y-aminobutyric acid. Dp = Dimethylaminopropylamide). Polyamides were synthesized by solid 
phase methods using Boc-proiected 3-methoxypyrrole, imidazole, and pyrrole aromatic amino 
acids, cleaved from the support by aminolysis, deprotected with sodium thiophenoxide, and 
purified by reversed phase HPLC, Baird, E. E. & Dervan, P. B. describes the solid phase 

15 synthesis of polyamides containing imidazole and pyrrole amino acids. 7. Am, Chem, Soc. 118, 
6141-6146 (1996); also see PCT US 97/003332. The identity and purity of the polyamides 
were verified by *H NMR, analytical HPLC, and matrix-assisted laser-desorption ionization 
time-of-flight mass spectrometry (MALDI-TOF MS-monoisotopic): 1 1223,6 (1223.6 
calculated), 2 1239.6 (1239.6 calculated); 3 1239.6 (1239,6 calculated). 

20 

Figure 2 illustrates binding models for polyamides 1-3 in complex with 5'-TGGTCA-3' 
and 5^TGGACA-3' (A#T and T#A in fourth position highlighted). Filled and unfilled circles 
represent imidazole and pyrrole rings respectively; circles containing an H represent 3- 
hydroxypyrrole, the curved line connecting the polyamide subunits represents y-aminobutyric 
25 acid, the* diamond represents p-alanine, and the + represents the positively charged 
dimethylaminopropylamide tail group. 

Figure 3 shows quantitative DNase I footprint titration experiments with polyamides 2 
and 3 on the 3' ^^P labeled 250-bp pJK6 EcoRl/Pvull restriction fragment. Lane 1, intact DNA; 
30 lanes 2-1 1 DNase I digestion products in the presence of 100, 50, 20, 10, 5, 2, 1, 0.5, 0.2. 0.1 
nM polyamide, respectively; lane 12, DNase I digestion products in the absence of polyamide; 
lane 13, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. describes an 
adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823-7830 (1987). 
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All reactions were done in a total volume of 400 jiL. A polyamide stock solution or H2O was 
added to an assay buffer containing radiolabeled restriction fragment, with the final solution 
conditions of 10 mM Tris-HCl, 10 mM KCl, 10 mM MgClj, 5 mM CaClz, pH 7.0. Solutions 
were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting reactions. 
5 Footprinting reactions," separation of cleavage products, and data analysis were carried out as 
described. White, S., Baird, E. E. & Dervan, P, B. Effects of the A»T/T»A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152(1996). 

10 Figure 4 shows the structure and equilibrium dissociation constant for numerous 

compounds of the present invention. Polyamides are shown in complex with their respective 
match site. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) rings, 
respectively; circles containing an H represent 3-hydroxypyrrole (Hp), the curved line 
connecting the polyamide subunits represents 7-aminobutyric acid (y), the diamond represents 

15 p-alanine (p), and the + represents the positively charged dimethylaminopropylamide tail group 
(Dp). The equilibrium dissociation constants are the average values obtained from three DNase 
I footprint titration experiments. The standard deviation for each set is less than 15% of the 
reported number. Assays were carried out in the presence of 10 mM Tris*HCl, 10 mM KCl, 10 
mM MgCl2, and 5 mM CaCb at pH 7.0 and 22T. 

20 

Figure 5 shows the synthetic scheme for 3-0-methyl-N-Boc protected pyrrole-2- 
carboxylate. The hydroxypyrrole monoester can be prepared in 0.5 kg quantity using published 
procedures on enlarged scale. 

25 Figure 6 shows the solid phase synthetic scheme for ImlmPyPy-y-ImHpPyPy-p-Dp 

starting from commercially available Boc-p-Pam-Resin: (i) 80% TFA/DCM, 0.4 M PhSH; (ii) 
Boc-Py-OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4 M PhSH; (iv) Boc-Py-OBt, DIEA, DMF; 
(v) 80% TFA/DCM, 0.4 M PhSH; (vi) Boc-3-OMe-Py-OH, HBTU, DMF, DIEA; (vii) 80% 
TFA>^CM, 0.4 M PhSH; (viii) Boc-Im-OH, DCC. HOBt; (ix) 80% TFA/DCM, 0.4 M PhSH; 

30 (x) Boc-y-aminobutyric acid, DIEA, DMF; (xi) 80% TFA/DCM, 0.4 M PhSH; (xii) Boc-Py- 
OBt, DIEA, DMF; (xiii) 80% TFA/DCM, 0.4 M PhSH; (xiv) Boc-Py-OBt, DMF, DIEA; (xv) 
80% TFA/DCM, 0.4 M PhSH; (vxi) Boc-Im-OH, DCC. HOBt (xvii) 80% TFA/DCM, 0.4 M 
PhSH; (xviii) imida2ole-2-carboxylic acid, HBTU, DIEA; (xviv) dimethylaminopropylamine, 
55 ''C, 18h. Purification by reversed phase HPLC provides ImlmPyPy-y-ImOpPyPy-p-Dp. (Op 

35 = 3-methoxypyrrole). Treatment of the 3-methyoxypyrrole polyamide with thiophenol, NaH, 
DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC purification. 
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Figure 7 shows the aromatic region from 7-11 ppm for the IH-NMR spectrum 
determined at 300 MHz for ImlmPyPy-y-ImOpPyPy-p-Dp and ImlmPyPy-y-ImHpPyPy-p-Dp. 
This region of the spectrum may be used to determine compoimd identity and purity. 

Figure 8 shows-the MALDI-TOF mass spectrum determined in positive ion mode with a 
monoisotopic detector for the polyamides for ImlmPyPy-y-ImOpPyPy-P-Dp and ImlmPyPy-y- 
ImHpPyPy-P-Dp. This spectrum may be used to determine compound identity and purity. 

Figure 9 shows the methyl group region from 3.5-4.0 ppm for the IH-NMR spectrum 
determined at 300 MHz for ImPyPy-y-OpPyPy-p-Dp and ImPyPy-y-HpPyPy-P-Dp, This region 
of the spectrum may be used to directly follow the progress for conversion of 3-methoxypyrrole 
to 3-hydroxypyrrole. 

Fig. 10 shows quantitative DNase I footprint titration experiments with the polyamides 
ImPyPy-y-PyHpPy-P-Dp and ImHpPy-y-PyPyPy-p-Dp on the 3^-"P labeled 370-bp pDEHl 
EcoRUPvuU restriction fragment. Intact lane, labeled restriction fragment no polyamide or 
DNase I added; lanes 1-10, DNase I digestion products in the presence of 10 ^M, 5 ^M, 2 [iM, 
1 fiM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM ImPyPy-y-PyPyPy-p-Dp, respectively 
or 1 ^M, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM, 5 nM, 2 nM, 1 nM ImHpPy-y- 
PyPyPy-P-Dp, respectively; DNase I lane, DNase I digestion products in the absence of 
polyamide; A lane, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. 
describes an adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823- 
7830 (1987). All reactions were done in a total volume of 40 nL. A polyamide stock solution 
or H2O was added to an assay buffer containing radiolabeled restriction fragment, with the final 
solution conditions of 10 mM Tris-HCL 10 mM KCl, 10 mM MgCh, 5 mM CaCh. pH 7.0. 
Solutions were allowed to equilibrate for 4-12 h at 22 before initiation of footprinting 
reactions. Footprinting reactions, separation of cleavage products, and data analysis were 
carried out as described. White, S., Baird, E. E. & Dervan, P. describe the pairing rules for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides. Chemistry & 
Biology 4,569-578(1997). 

Figure 1 1 shows the synthesis of a bifimctional polyamide which incorporates the Hp/Py 
pair. Treatment of a sample of ImlmPyPy-y-ImHpPyPy-P-Pam-resin (see Figure 6) with 3,3'- 
diamino-A^-methyldipropylamine, 55°C, 18 h followed by reverse phase HPLC purification 
provides the Op polyamide with a free primary amine group which can be coupled to an 
activated carboxylic acid derivative. Treatment with (i) EDTA-dianhydride, DMSO/NMP, 
DIEA, 55 **C; (ii) O.IM NaOH, followed by reverse phase HPLC purification provides the Op- 
Py-Im-polyamide-EDTA conjugate. Treatment of the 3-methyoxypyrrole polyamide with 
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thiophenol, NaH, DMF, at 100 **C for 120 min provides polyamide 2 after reverse phase HPLC 
purification. 

Figure 12 shows the determination of the binding orientation of hairpin polyamides 

5 IniImPyPy-Y-IniHpPyPy-p-Dp-EDTA*Fe(U) 2-E^Feai) and ImlmHpPy-y-ImPyPyPy-P-Dp- 
EDTA»Fe(II) 3-E»Fe(II) by affinity cleaving footprint titration. Top and bottom left: Affinity 
cleavage experiments on a 3' "P labeled 250-bp pJK6 EcoRU Pvu II restriction fi-agment. The 
5'-TGGACA-3' and 5'-TGGTCA-3' sites are shown on the right side of the autoradiogram. 
Top left: lane 1, adenine-specific chemical sequencing reaction; lanes 2-6, 6.5 nM, 1.0 fiM. 

10 100 nM, 10 nM, 1 nM polyamide 2-E*Fe(II); lane 7, intact restriction fragment, no polyamide 
added. Bottom left: lane 1, A reaction; lanes 2-6, 8.5 ^M, 1.0 jiM. 100 nM, 10 nM, 1 nM 
polyamide 3-E»Fe(II); lane 7, intact DNA. All reactions were carried out in a total volume of 
40 ^L. A stock solution of polyamide or H2O was added to a solution containing 20 kcpm 
labeled restriction fragment, affording final solution conditions of 25 mM Tris-Acetate, 20 mM 

15 NaCl, 100 fiM/ bp calf thymus DNA, at pH 7.0. Solutions were allowed to equilibrate for a 
minimum of 4 h at 22°K before initiation of reactions. Affinity cleavage reactions were carried 
out as described White, S., Baird, E.E. & Dervan, P.B. Effects of the A»T/T«A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). Top and bottom right: Affinity cleavage patterns of 2-E»Fe(II) and 3-E»Fe(II) at 

20 100 nM bound to 5'-TGGACA-3' and 5'-TGGTCA-3\ Bar heights are proportional to the 
relative cleavage intensities at each base pair. Shaded and nonshaded circles denote imidazole 
and pyrrole carboxamides, respectively. Nonshaded diamonds represent the P-alanine moiety. 
A curved line represents the y-aminobutyric acid, and the + represents the positively charged 
dimethylaminopropylamide tail group. The boxed Fe denotes the EDTA-Fe(II) cleavage 

25 moiety. 

Figure 13 shows quantitative DNase I footprint titration experiments with the 
polyamides ImPyPyPyPy-y-ImPyPyPyPy-P-Dp and ImRpPyPyPy-y-IrnHpFyPyPy-P-Dp on the 
3' ^^P labeled 252-bp pJK7 EcdRlI Pvii II restriction fragment. For ImPyPyPyPy-y- 

30 ImPyPyPyPy-p-Dp gel (left): lane 1, DNase I digestion products in the absence of polyamide; 
lanes 2-18, DNase I digestion products in the presence of 1.0 ^M, 500 nM, 200, 100, 65, 40, 25, 
15. 10, 6.5, 4.0, 2.5, 1.5, 1.0, 0.5, 0.2. 0.1 nM polyamide; lane 19, DNase I digestion products in 
the absence of polyamide; lane 20, intact restriction fragment; lane 21, guanine-specific 
chemical sequencing reaction; lane 22, adenine-specific chemical sequencing reaction. For 

35 ImHpPyPyPy-y-ImHpPyPyPy-p-Dp gel (right): lane 1, intact DNA; lane 2, DNase I digestion 
products in the absence of polyamide; lanes 3-19, 1.0 ^M. 500 nM, 200, 100, 50, 20, 10. 5, 2, 1, 
0.5, 0.2. O.l, 0.05, 0.01, 0.005, 0.001 nM polyamide; lane 20, DNase I digestion products in the 
absence of polyamide; lane 21, A reaction. All reactions were done in a total volume of 400 
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^L. A polyamide stock solution or H2O was added to an assay buffer containing radiolabeled 
restriction fragment, with the final solution conditions of 10 mM Tris-HCl, 10 mM KCl, 10 mM 
MgCl2, 5 nxM CaCl2, pH 7.0. Solutions were allowed to equilibrate for 4-12 h at 22**C before 
initiation of footprinting reactions. Footprinting reactions, separation of cleavage products, and 
5 data analysis were carried as described. White, S., Baird, E.E. & Dervan, P.B. Effects of the 
A*T/T*A degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996). 

Fig. 14 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
10 the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X1X2X3X4-7-X5X6X7X8 wherein y is the -NH-CH2-CH2-CH2-CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X3/X6, X2/X7, and Xi/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
15 WGTNNW-3', where the 5'-GTNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGTNNW-3' sequences are drawn as binding models 
20 where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

Fig. 15 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
25 the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiXjXsX^-y-XsXaXyXg wherein y is the -NH-CH2-CH2-CH2-CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived Scorn R-2,4- 
diaminobutyric acid; X4/X5, Xj/Xg, X2/X7, and Xi/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
30 WGANNW-3\ where the 5'-GANN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be botmd. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGANNW-3' sequences are drawn as binding models 
35 where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 
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Fig. 16 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiX2X3X4-y-X5X6X7X8 wherein y is the -NH-CH2-CH2-CH2-CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 

5 diaminobutyric acid; X4/X5, Xj/X^, X2/X7, and X|/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGGNNW-3', where the 5'-GGNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 

10 minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGGNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

15 

Fig. 1 7 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiX2X3X4-y-X5X6X7X8 wherein y is the -NH-CH2-CH2-CH2-CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 

20 diaminobutyric acid; X4/X5, X3/X6, X2/X7, and Xj/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGCNNW-3', where the 5'-GCNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to conespond to the DNA base pairs in the 

25 minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGCNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

30 

Four-ring polyamide subunits, covalently coupled to form eight-ring hairpin structures, 
bind specifically to 6-bp target, sequences at subnanomolar concentrations. Trauger, J.W., 
Baird, E. E. & Dervan, P.B. describe the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996); Swalley, S. E., Baird, E. E. & 
35 Dervan, P. B. describe the discrimination of 5'-GGGG-3', 5*-GCGC-3', and 5'-GGCC'3' 
sequences in the minor groove of DNA by eight-ring hairpin polyamides. J, Am. Chem. Soc. 
119» 6953-6961 (1997). The DNA-binding affinities of three eight-ring hairpin polyamides 
shown in Figure 1 as compound 1, 2, and 3 containing pairings of Im/Py, Py/Im opposite G»C, 
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C«G and either Py/Py, Hp/Py, or Py/Hp at a common single point opposite T«A and A«T has 
been determined. Equilibrium dissociation constants (KJ) for ImlmPyPy-y-ImPyPyPy-P-Dp 1, 
ImlmPyPy-y-ImHpPyPy-p-Dp 2, ImImHpPy-y-ImPyPyPy-p-E>p 3 of Figure 1 are shown in 
Table 1. Brenowitz, M., Senear, D. F., Shea, M. A. & Ackers, G. K. describe a quantitative 
5 DNase footprint titration method for studying protein-DNA interactions. Methods EnzymoL 
130, 132-181 (1986); The values were determined by quantitative DNase I footprint 
titration experiments: on a 3* ^^P-labeled 250-bp DNA fragment containing the target sites, 5'- 
TGGACA-3* and 5'-TGGICA-3' which differ by a single A^T base pair in the fourth position. 
The DNase footprint gels are shown in Figure 3. 

10 

TABLE 1 Equilibrium dissociation constants* 



Polyamidet 5'-TGGTCA-3' 5'-TGGACA-3' ^rtl* 
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*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation for each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris«HCl 10 mM KCl, 10 mM MgClj, and 5 mM CaClj at pH 7.0 and 22 "C. 
tRing pairing opposite T»A and A»T in the fourth position. 
^Calculated as Kd(5'-TGGACA-3') / K^i(5'-TGGTC A-3'). 

Based on the pairing rules for polyamide-DNA complexes both of these sequences are a 
match for control polyamide 1 which places a Py/Py pairing opposite 

15 A»T and T»A at both sites. It was determined that in polyamide 1 (Py/Py) binds to 5'- 
TGGICA-3' and 5'-TGGACA-3' within a factor of 2 (K^ = 0.077 or 0.15 nM respectively). In 
contrast, polyamide 2 (Py/Hp) binds to 5'-TGGICA-3' and 5'-TGGACA-3' with dissociation 
constants which differ by a factor of 18 (K^ = 15 nM and 0.83 nM respectively). By reversing 
the pairing in polyamide 3 (Hp/Py) the dissociation constants differ again in the opposite 

20 direction by a factor of 77 (K^ = 0.48 nM and 37 nM respectively. Control experiments 
performed on separate DNA fragments; reveal that neither a 5'-TGGGCA-3' or a 5'-TGG£CA- 
3' site is bound by polyamide 2 or 3 at concentrations < 100 nM, indicating that the Hp/Py and 
Py/Hp ring pairings do not bind opposite G»C or C«G. The A»T vs. T*A discrimination is 
achieved preferably when the two neighboring base pairs are G^C and C»G (GTC vs. GAC). 

25 
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The specificity of polyamides 2 and 3 for sites which differ by a single A^^/^^A base 
pair results from small chemical changes. Replacing the Py/Py pair in 1 with a Py/Hp pairing 
as in 2, a single substitution of C3-0H for C3-H, destabilizes interaction with 5'-TGGICA-3' 
by 191-fold, a free energy difference of 3.1 kcal mol'*. Interaction of 2 with 5'-TGGACA-3' is 
5 destabilized only 6-foki relative to 1, a free energy difference of 1.1 kcal mof*. Similarly, 
replacing the Py/Py pair in 1 with Hp/Py as in 3 destabilizes interaction with 5'-TGGACA-3' 
by 252-fold, a free energy difference of 3.2 kcal moV\ Interaction of 3 with 5'TGGICA-3' is 
destabilized only 6- fold relative to 1, a free energy difference of 1.0 kcal mof*. 

The polyamides of this invention provide for coded targeting of predetennined DNA 
sequences with affinity and specificity comparable to sequence-specific DNA binding proteins. 
Hp, Im, and Py polyamides complete the minor groove recognition code using three aromatic 
amino acids which combine to form four ring pairings (Im/Py, Py/Im, Hp/Py, and Py/Hp) which 
complement the four Watson-Crick base pairs, as shown in TABLE 2. There are a possible 240 
four base pair sequences which contain at least 1 A»T or T»A base pair and therefore can 
advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides binding to any of 
these sequences can be designed in accordance with the code of TABLE 2. 



10 



15 



TABLE 2 Pairing code for minor groove recognition* 
Pair G'C C^G T»A A»T 

Im/Py + - - - 

Py/Im - 4- • - 

Hp/Py - - + - 

Py/Hp - - - + 



* favored (+), disfavored (-) 

20 
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For certain G^C rich sequences the affinity of polyamide-DNA complexes may be 
enhanced by substitution of an Im/p pair for Im/Py at G»C and p/Im for Py/Im at C«G. At A*T 
and T-A base pairs, either a Py/p, p/Py, and p/p may be used. The alternate aliphatic/aromatic 
amino acid pairing code is described in Table 3. 



TABLE 3 Aliphatic/ Aromatic substitoition for ring 


pairings* 




Pair 


Substitution 


Im/Py 


Im/p 


Py/Im 


p/lm 


Hp/Py 


Py/p.p/Py,Hp/p.p/p 


Py/Hp 


Py/p, p/Py, p/Hp,p/p 



U. S. Patent 5,578,444 describes numerous promoter region targeting sequences from 
which base pair sequences for targeting a polyamide can be identified. 

PCT U.S. 97/003332 describes methods for synthesis of polyamides which are suitable 
for preparing polyamides of this invention. The use of P-alanine in place of a pyrrole amino 
acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/Im, Py/p, and p/Py) 
and aliphatic/aliphatic pairing (p/p) substitution. The use of y-aminobutyric acid, or a 
substituted y-aminobutyric acid such as (R)-2,4 diaminobutyric acid, provides for preferred 
hairpin turns. The following examples illustrate the synthesis of polyamides of the present 
invention. 
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Example 1: 

PREPARATION OF A PROTECTED Hp MONOMER FOR SOLID PHASE 

SYNTHESIS. 

Distamycin and its analogs have previously been considered targets of traditional 
5 multistep synthetic chemistry. Arcamone, F., Orezzi, P. G., Barbieri, W., Nicolella, V. & Penco, 
S. describe a solution phase synthesis of distamycin Gazz, Chim, Ital 1967, 97, 1097, The 
repeating amide of distamycin is formed from an aromatic carboxylic acid and an aromatic 
amine. The aromatic acid is often unstable to decarboxylation, and the aromatic amines have 
been found to be air and light sensitive. Lown, J. W. & Krowicki, K. describe a solution phase 

10 synthesis of Distamycin /. Org. Chem, 1985, JO, 3774. The variable coupling yields, long 
reaction times (often >24 h), numerous side products, and reactive intermediates (acid chlorides 
and trichloro ketones) characteristic of the traditional solution phase coupling reactions make 
the synthesis of the aromatic carboxamides problematic. B. Merrifield describes the solid phase 
synthesis of a tetrapeptide J. Am, Chem. Soc. 1963, 85, 2149. In order to implement an efficient 

15 solid phase methodology for the synthesis of the pyrrole- imidazole polyamides, the following 
components were developed: (1) a synthesis which provides large quantities of appropriately 
protected monomer or dimer building blocks in high purity, (2) optimized protocols for forming 
an amide in high yield from a support-bound aromatic amine and an aromatic carboxylic acid, 
(3) methods for monitoring reactions on the solid support, and (4) a stable resin linkage agent 

20 that can be cleaved in high yield upon completion of the synthesis. Baird, E. E. & Dervan, P. B. 
describes the solid phase synthesis of polyamides containing imidazole and pyrrole amino 
acids, y. Am. Chem. Soc. 118, 6141-6146 (1996); also see PCT US 97/003332. In order to 
prepare polyamides which contain the 3-hydroxypyrrole monomer, a synthesis has been 
developed which allows the appropriately protected Boc-Op acid monomer to be prepared on 50 

25 g scale. NMR and '^c NMR spectra were recorded on a General Electric-QE 300 NMR 
spectrometer in CD3OD or DMSO-t/e, with chemical shifts reported in parts per million relative 
to residual CHD2OD or DMSO-rfs, respectively. IR spectra were recorded on a Perkin-EUner 
FTIR spectrometer. High-resolution mass spectra were recorded using fast atom bombardment 
(FABMS) techniques at the Mass Spectrometry Laboratory at the University of California, 

30 Riverside. Reactions were executed under an inert argon atmosphere. Reagent grade chemicals 
were used as received unless otherwise noted. Still, W. C, Kahn, M. & Mitra, A. describe flash 
column chromatography J, Org, Chem, 1978, 40, 2923-2925. Flash chromatography was 
carried out using EM science Kieselgel 60 (230-400) mesh. Thin-layer chromatography was 
performed on EM Reagents silica gel plates (0.5 nun thickness). All compounds were 

35 visualized with short-wave ultraviolet light. 
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Table 4 :Intermediates for preparation of Boc-prote cted 3-methoxypyrrole 
NAME STRUCTURE 



Ethyl 4-carboxy-3-hydfoxy-l- 
.mediylpyrrole-2-cart)oxylate. 



Ethyl 4-[(Benzyloxycarbonyl)amino]-3- 
hydroxy-l-methylpyrrole-2-carboxylate 




Ethyl 4-[(Ben2yloxycarbonyl)amino]-3- 
methoxy-1 -methyl pyrroie-2-carboxy late 



Ethyl 4-[(tert-Butyloxycarbonyl)amino]-3- 
methoxy-l-methylpyrrole-2-carboxylate 



4-((tert-Butyloxycarbonyl)amino]-3-methoxy 
-1 -methyl pyrrole-2-carboxylic add 




Ethyl 4-[ (benzyloxycarbonyl)amino]'3-hydroxy-l'methylpyrrole'2-carboxy^ Ethyl-4- 
carboxy-3-hydroxy-l-methylpyrrole-2-carboxylate (60 g, 281.7 mmol) was dissolved in 282 
mL acetonitrile. TEA (28.53 g. 282 mmol) was added, followed by diphenylphosphorylazide 
(77.61 g, 282 mmol). The mixture was refluxed for 5 hours, followed by addition of benzyl 
alcohol (270 ml) and reflux continued overnight. The solution was cooled and volitiles 
removed in vacuo. The residue was absorbed onto silca and chromatagraphed, 4:1 hexanes : 
ethyl acetate, to give a white solid (21.58 g, 24%) NMR (DMS0-d6) 5 8.73 (s, IH), 8.31 (s, 
IH), 7.31 (m, 5H), 6.96 (s, IH), 5.08 (s, 2H), 4.21 (q, 2H, J = 7.1 Hz), 3.66 (s, 3H), 1.25 (t, 3H, 
J = 7.1 Hz); MS m/e 319.163 (M+H 319.122 calcd. for C16H18N2O5). 

Ethyl 4'f(tert'butoxycarbonyl)aminoJ'3-methoxy-I-methylpyrrole-2' carboxylate. Ethyl 
4-[(benzyloxycarbonyl)amino]-3-hydroxy-l-methylpyrrole-2-carboxylate (13.4 g, 42.3 mmol) 
was dissolved in 110 mL acetone. Anhydrous K2CO3 (11.67 g, 84.5 mmol) was added. 
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followed by methyliodide (5.96 g, 42.3 mmol) and dimethylaminopyridine (0.5 g, 4.23 mmol) 
and the mixture stirred overnight. The solid K2CO3 was removed by filtration and 200 ml 
water added. Volitiles were removed in vacuo and the solution made acidic with addition of IN 
H2SO4 . The aqueous layer was extracted with diethyl ether. Organic layers were combined, 
5 washed with 10% H2SO4, dried over MgS04, and dried to give a white solid. The solid was 
used without further purification and dissolved in 38 ml DMF. DIEA (1 1 ml), Boc anhydride 
(9.23 g, 42.3 mmol), and 10 % Pd/C (500 mg) were added and the solution stirred under 
hydrogen (1 atm) for 2.1 h. The slurry was fihered through celite which was washed with 
methanol. Water 250 ml was added and volitiles removed in vacuo. The aqueous layer was 
10 extracted with ether. Organic layers were combined, washed with water and brine, and dried 
over MgS04. Solvent was removed in vacuo to give a white solid ( 8.94 g, 71%) NMR 
(DMS0-d6) 5 8.43 (s, IH), 7.03 (s, IH), 4.19 (q. 2H, J = 7.1 Hz), 3.70 (s, 3H), 3.67 (s, 3H). 
1.42 (s. 9H), 1.26 (t, 3H, J = 7.1); MS m/e 299.161 (M+H 299.153 calcd. for C14H22N2O5). 

15 Ethyl 4-[(benzyloxycarbonyl)aminoJ'3'hydroxy-l-methylpyrrole-2-carboxylate Ethyl 4- 

[(/err-butoxycarbonyl)amino]-3-methoxy-l-methylpyrrole-2-carboxylate (9.0 g, 30.2 mmol) 
was dissolved in 30 mL ethanol. NaOH (30 ml, 1 M, aq) was added and the solution stirred for 
4 days. Water (200 ml) was added and ethanol removed in vacuo. The solution was extracted 
with diethyl ether, aqueous layer acidified to pH = 2-3, and extracted again with diethyl ether. 

20 Organic layers were dried over MgS04, and solvent removed in vacuo to give a white solid (6.0 
g, 20.5 nmiol, 87% based on recovered SM) NMR (DMSO-dg) 5 12.14 (s, IH), 8.37 (s, 
IH), 6.98 (s, IH), 3.69 (s, 3H), 3.66 (s, 3H), 1.42 (s, 9H); MS m/e 293.112 (M+H 293.104 
calcd. for C 1 2H 1 8N2O5). 

25 EXAMPLE 2: 

SOLID PHASE SYNTHESIS OF 3-HYDROXyPYRROLE POLY AMIDES. 

Cycling protocols were optimized to afford high stepwise coupling yields (>99%). 
Deprotection by aminolysis affords up to 100 mg quantities of polyamide. Solid phase 

30 polyamide synthesis protocols were modified firom the in situ neutralization Boc-chemistry 
protocols. Schnolzer, M., Alewood, P.. Jones, A., Alewood, D., Kent, S.B.H. report rapid in situ 
neutralization for solid phase peptide synthesis Int. J. Peptide. Protein. Res. 1992, 40^ 180. 
Coupling cycles are rapid, 72 min per residue for manual synthesis or 180 min per residue for 
machine-assisted synthesis, and require no special precautions beyond those used for ordinary 

35 solid phase peptide synthesis. Manual solid phase synthesis of a pyrrole-imidazole polyamide 
consists of a dichloromethane (DCM) wash, removal of the Boc group with trifluoroacetic acid 
(TFA)/DCM/thiophenol (PhSH), a DCM wash, a DMF wash, taking a resin sample for analysis, 
addition of activated monomer, addition of DIEA if necessary, coupling for 45 min, taking a 
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resin sample for analysis, and a final DMF wash (Figure 5, Table I). In addition, the manual 
solid phase protocol for synthesis of pyrrole-imidazole polyamides has been adapted for use on 
a ABI 430A peptide synthesizer. The aromatic amine of the pyrrole and imidazole do not react 
in the quantitative ninhydrin test. Stepwise cleavage of a sample of resin and analysis by HPLC 
5 indicates that high stepwise yields (> 99%) are routinely achieved, 

Dicyclohexylcarbodiimide (DCC), Hydroxybenzotriazole (HOBt), 2-(lH-Benzotriazole- 
l-yl)-l,l,3.3-tetramethyluronium hexa-fluorophosphate (HBTU) and 0.2 mmol/gram Boc-p- 
alanine-(-4.carboxamidomethyl)-benzyl-ester-copoly(styrene-divinylbenzene) resin (Boc-p- 

10 Pam-Resin) was purchased from Peptides International (0.2 mmol/gram), NovaBiochem (0.6 
mmol/gram), or Peninsula (0.6 mmol/gram). ( (/J)-2-Fmoc-4.Boc-diaminobutyric acid, (S)-2- 
Fmoc-4-Boc-diaminobutyric acid, and (/?)-2-amino-4-Boc-diaminobutyric acid were purchased 
fi-om Bachem. MAT-diisopropylethylamine (DIEA), ;V,A^-dimethylformamide (DMF), N- 
methylpyrrolidone (NMP), DMSO/NMP, Acetic anhydride (AC2O), and 0.0002 M potassium 

15 cyanide/pyridine were purchased from Applied Biosystems. Dichloromethane (DCM) and 
triethylamine (TEA) were reagent grade from EM, thiophenol (PhSH), 
dimethylaminopropylamine (Dp). Sodium Hydride, (i?)-a-methoxy-a- 
(trifuoromethyl)phenylacetic acid ((;?)MPTA) and (5)-a-methoxy-a- 
(trifouromethyl)phenylacetic acid ((5)MPTA) were from Aldrich, trifluoroacetic acid (TFA) 

20 Biograde from Halocarbon, phenol from Fisher, and ninhydrin from Pierce. All reagents were 
used without fiirther purification. 

Quik-Sep polypropylene disposable filters were purchased from Isolab Inc, *H NMR 
spectra were recorded on a General Electric-QE NMR spectrometer at 300 MHz with chemical 

25 shifts reported in parts per million relative to residual solvent. UV spectra were measured in 
water on a Hewlett-Packard Model 8452 A diode array spectrophotometer., Optical rotations 
were recorded on a JASCO Dip 1000 Digital Polarimeter. Matrix-assisted, laser 
desorption/ionization time of flight mass spectrometry (MALDI-TOF) was performed at the 
Protein and Peptide Microanalytical Facility at the Califomia Institute of Technology. HPLC 

30 analysis was performed on either a HP 1090M analytical HPLC or a Beckman Gold system 
using a RAINEN Cjg, Microsorb MV, 5^m, 300 x 4.6 mm reversed phase column in 0.1% 
(wt/v) TFA with acetonitrile as eluent and a flow rate of 1,0 mL/min, gradient elution 1.25% 
acetonitrile/min. Preparatory reverse phase HPLC was performed on a Beckman HPLC with a 
Waters DeltaPak 25 x 100 mm, 100 ^m C18 column equipped with a guard, 0.1% (wt/v) TFA, 

35 0.25% acetonitrile/min. 18MQ water was obtained from a Millipore MilliQ water purification 
system, and all buffers were 0.2 ^m filtered. 
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Activati n f Boc-3-meth xypyrrole acid. The amino acid (0.5 mmol) was dissolved in 2 mL 
DMF. HBTU (190 mg, 0.5 mmol) was added followed by DIEA (1 mL) and the resulting 
mixture was shaken for 5 min. 

5 Activation of Imidazole-2-carboxylic acid, y-aminobutyric acid, Boc-giycine, and Boc-p- 
alanine. The appropriate amino acid or acid (2 mmol) was dissolved in 2 mL DMF. HBTU 
(720 mg, 1.9 mmol) was added followed by DIEA (1 mL) and the solution shaken for at least 5 
min. 

10 Activation of Boc-Imidazole acid. Hoc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 
1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, 1 mmol) is then added and the solution 
allowed to stand for at least 5 min. 

Acetylation Mix. 2 mL DMF. DIEA (710 nL, 4.0 mmol), and acetic anhydride (380 \iL, 4.0 
15 mmol) were combined immediately before use. 

Manual Synthesis Protocol. Boc-B-alanine-Pam-Resin (1.25 g, 0.25 nunol) is placed in a 20 
mL glass reaction vessel, shaken in DMF for 5 min and the reaction vessel drained. The resin 
was washed with DCM (2 x 30 s.) and the Boc group removed with 80% TFA/DCM/0.5M 

20 PhSH, 1 X 30s., 1 X 20 min The resin was washed with DCM (2 x 30 s.) followed by DMF (1 x 
30 s.) A resin sample (5-10 mg) was taken for analysis. The vessel was drained completely and 
activated monomer added, followed by DIEA if necessary. The reaction vessel was shaken 
vigorously to make a slurry. The coupling was allowed to proceed for 90 min, and a resin 
sample taken. Acetic anhydride (1 mL) was added and the reaction shaken for 5 min. The 

25 reaction vessel was then washed with DMF, followed by DCM. 

Machine-Assisted Protocols. Machine-assisted synthesis was performed on a ABI 430A 
synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 mmoiygram). Each cycle of amino acid 
addition involved: deprotection with approximately 80% TFA/DCM/0.4M PhSH for 3 minutes, 

30 draining the reaction vessel, and then deprotection for 1 7 minutes; 2 dichloromethane flow 
washes; an ^MP flow wash; draining the reaction vessel; coupling for 1 hour with in situ 
neutralization, addition of dimethyl sulfoxide (DMSO)/NMP. coupling for 30 minutes, addition 
of DIEA, coupling for 30 minutes; draining the reaction vessel; washing with DCM, taking a 
resin sample for evaluation of the progress of the synthesis by HPLC analysis; capping with 

35 acetic anhydride/DIEA in DCM for 6 minutes; and washing with DCM. A double couple cycle 
is employed when coupling aliphatic amino acids to imidazole, all other couplings are 
performed with single couple cycles. 
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The ABI 430A synthesizer was left in the standard hardware configuration for NMP- 
HOBt protocols. Reagent positions 1 and 7 were DIEA, reagent position 2 was TFA/0.5M 
thiophenol, reagent position 3 was 70% ethanolamine/methanol, reagent position 4 was acetic 
anhydride, reagent position 5 was DMSO/NMP, reagent position 6 was methanol, and reagent 
5 position 8 was DMF: New activator functions were written, one for direct transfer of the 
cartridge contents to the concentrator (switch list 21, 25, 26, 35, 37, 44), and a second for 
transfer of reagent position 8 directly to the cartridge (switch list 37, 39, 45, 46). 

Boc-Py-OBt ester (357 mg, 1 mmol) was dissolved in 2 ml DMF and filtered into a 
10 synthesis cartridge. Boc-Im acid monomer was activated (DCC/HOBt), filtered, and placed in a 
synthesis cartridge. Imidazole-2-carboxylic acid was added manually. At the initiation of the 
coupling cycle the synthesis was interrupted, the reaction vessel vented and the activated 
monomer added directly to the reaction vessel through the resin sampling loop via syringe. 
When manual addition was necessary an empty synthesis cartridge was used. Aliphatic amino 
15 acids (2 mmol) and HBTU (1.9 mmol) were placed in a synthesis cartridge. 3 ml of DMF was 
added using a calibrated delivery loop from reagent bottle 8, followed by calibrated delivery of 
1 ml DIEA from reagent bottle 7, and a 3 minute mixing of the cartridge. 

The activator cycle was written to transfer activated monomer directly from the cartridge to 
20 the concentrator vessel, bypassing the activator vessel. After transfer, 1 ml of DIEA was 
measured into the cartridge using a calibrated delivery loop, and the DIEA solution combined 
with the activated monomer solution in the concentrator vessel. The activated ester in 2:1 
DMF/DIEA was then transferred to the reaction vessel. All lines were emptied with argon - 
before and after solution transfers. 

25 

ImlmOpPy-y-ImPyPyPy-^'Dp. ImlmOpPy-y-ImPyPyPy-P-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
nmiol/g). Baird, E. E. & Dervan, P. B. describes the soHd phase synthesis of polyamides 
containing imidazole and pyrrole amino acids. J, Am. Chem, Soc. 118, 6141-6146 (1996); also 

30 see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImlmOpPy-y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL 
peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 

35 (55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImlmOpPy-y-ImPyPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (97 mg, 49% recovery). UV (H2O) 246, 316 (66,000); NMR (DMSO-rf^) 



23 



WO98/37066 



PCT/US98/01006 



5 10.24 (s, 1 H), 10.14 (s, 1 H), 9.99 (s, 1 H), 9,94 (s, 1 H), 9.88 (s, 1 H), 9.4 (br s, 1 H), 9.25 (s, 
1 H), 9.1 1 (s, 1 H), 8.05 (m, 3 H), 7.60 (s, 1 H), 7.46 (s, 1 H), 7.41 (s, 1 H), 7.23 (d, 1), 7.21 (d, 
1 H), 7.19 (d. 1 H), 7.13 (m, 2 H), 7.11 (m, 2 H), 7.02 (d, 1 H), 6.83 (m, 2 H), 3.96 (s, 6 H), 
3.90 (s, 3 H), 3.81 (m, 6 H), 3.79 (s, 3 H), 3.75 (d, 9 H), 3.33 (q, 2 H, 7 = 5.4 Hz), 3.15 (q, 2 H, 
5 y = 5.5 Hz), 3.08 (q, 2 H, /= 6.0 Hz). 2.96 (quintet, 2 H, 7= 5.6 Hz), 2.70 (d, 6 H, 7 = 4.5 Hz), 
2.32 (m, 4 H), 1.71 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1253.5 (1253.6 calc. for 
C58H72N22OU). 

ImlmMpPy-y-ImPyPyPy. In order to remove the methoxy protecting group, a sample of 

10 ImlmOpPy-Y-ImPyPyPy-p-Dp (5 mg, 3.9 ^mol) was treated with sodium thiophenoxide at 100 
°C for 2 h. DMF (1000 ^L) and thiophenol (500 ^iL) were placed in a (13 x 100 mm) disposable 
Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmOpPy-y-ImPyPyPy- 
P-Dp (5 mg) dissolved in DMF (500 nL) was added. The solution was agitated, and placed in a 

15 100 *=*C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0*^0, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 
aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImPyPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (3.8 mg, 77 % recovery). UV (HjO) 246, 312 (66,000); *H NMR 

20 (DMSO-de) 5 10.34 (s, 1 H), 10.24 (s, 1 H), 10.00 (s, 2 H), 9.93 (s, 1 H), 9.87 (s, 1 H), 9.83 (s. 
1 H), 9.4 (br s, 1 H). 9.04 (s. 1 H), 8.03 (m, 3 H), 7.58 (s, 1 H), 7.44 (s, 1 H), 7.42 (s, 1 H), 7.23 
(s, 1 H), 7.20 (m, 3 H), 7.12 (m, 2 H). 7.05 (d, 1 H), 7.02 (d, 1 H), 6.83 (s, 1 H). 6.79 (s, 1 H), 
3.96 (s. 6 H), 3.90 (s, 3 H), 3.81 (s, 6 H), 3.79 (s, 3 H), 3.75 (d, 6 H), 3.33 (q, 2 H. 7 = 5.4 Hz), 
3.14 (q, 2 H, y = 5.4 Hz), 3.08 (q, 2 H, 7= 6.1 Hz), 2.99 (quintet, 2 H, 7 = 5.4 Hz), 2.69 (d, 6 H, 

25 J= 4.2 Hz), 2.31 (m, 4 H), 1.72 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 
calc. for C57H71N22O1,). 

ImlmPyPy-y-ImOpPyPy'^-Dp, ImlmPyPy-y-ImOpPyPy-p-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 

30 mmol/g) as described for ImlmOpPy-y-ImPyPyPy-P-Dp. A sample of ImlmPyPy-y-ImOpPyPy- 
p-Pam-Resin (400 mg. 0.40 mmol/gram) was placed in a glass 20 mL peptide synthesis vessel 
and treated with neat dimethylaminopropylamine (2 mL) and heated (55 °C) with periodic 
agitation for 16 h. The reaction mixture was then filtered to remove resin, 0.1% (wt/v) TFA 
added (6 mL) and the resuhing solution purified by reversed phase HPLC. ImlmPyPy-y- 

35 ImOpPyPy-p-Dp is recovered upon lyophilization of the appropriate fractions as a white 
powder (101 mg, 50% recovery). UV (HjO) 246, 316 (66,000); MALDI-TOF-MS 
(monoisotopic), 1253.6 (1253.6 calc. for C58H72N220n). 
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ImlmPyPy-y-ImHpPyPy. A sample of ImlmPyPy-Y-ImOpPyPy-P-Dp (5 mg, 3.9 ^imol) 
was treated with sodium thiophenoxide and purified by reversed phase HPLC as described for 
ImImHpPy-7-ImPyPyPy.p-Dp. ImlmPyPy-y.ImHpPyPy-p-Dp is recovered upon lyophilization 
of the appropriate fractions as a white powder (3.2 mg, 66 % recovery). UV (H2O) }^ 246, 
5 312 (66.000); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 calc. for C57H71N22O,,). 

ImPyPy-y-OpPyPy-^-Dp. ImPyPy-y-OpPyPy-p.Pam-Resin was synthesized in a 
stepwise fashion by machine-assisted solid phase methods fi-om Boc-p-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 

10 containing imidazole and pyrrole amino acids. 1 Am. Chem, Soc, 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 nunol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImPyPy-y-OpPyPy-p-Pam-Resin (400 mg, 0.45 mmol/gram) was placed in a glass 20 mL 

15 peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 
(55 **C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resuUing solution purified by reversed phase HPLC. 
ImPyPy-y-OpPyPy-p-Dp is recovered upon lyophilization of the appropriate firactions as a 
white powder (45 mg, 25% recovery). UV (H2O) 246. 310 (50,000); *H NMR (DMSO-de) 

20 8 10.45 (s, 1 H), 9.90 (s, 1 H), 9.82 (s, 1 H), 9.5 (br s, 1 H), 9.38 (s, 1 H), 9.04 (s, 1 H), 8.02 (m, 
3 H), 7.37 (s, 1 H), 7.25 (m, 2 H), 7.15 (d, 1 H, /= 1.6 Hz), 7.1 1 (m, 2 H), 7.09 (d, 1 H), 7.03 
(d, 1 H), 6.99 (d, 1 H), 6.87 (d, 1 H), 6.84 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 3.77 (s, 6 H), 
3.76 (s, 3 H), 3.74 (s, 1 H), 3.34 (q, 2 H, J = 5.6 Hz), 3.20 (q, 2 H, 7- 5.8 Hz), 3.09 (q. 2 H, J = 
6.1 Hz), 2.97 (quintet, 2 H,y = 5.3 Hz), 2.70 (d, 6 H, 7= 3.9 Hz), 2.34 (m, 4 H), 1.73 (m, 4 H); 

25 MALDI-TOF-MS (monoisotopic), 1007.6 (1007.5 calc. for C48H63N,609). 

ImPyPy-y-HpPyPy, In order to remove the methoxy protecting group, a sample of 
ImPyPy-y-OpPyPy-p-Dp (5 mg, 4.8 nmol) was treated with sodium thiophenoxide at 100 °C 
for 2 h. DMF (1000 ^L) and thiophenol (500 ^iL) were placed in a (13 x 100 mm) disposable 

30 Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmPyPy-y-ImOpPyPy- 
p-Dp (5 mg) dissolved in DMF (500 jiL) was added. The solution was agitated, and placed in a 
100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to O^'C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 

35 aqueous layer is separated fi-om the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImHpPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fi-actions (2.5 mg, 52 % recovery). UV (H2O) 246, 310 (50,000); ^H NMR 
(DMSO-J5) 5 10.44 (s, 1 H), 10.16 (s, 1 H), 9.90 (s, 1 H), 9.77 (s, 1 H), 9.5 (br s, 1 H), 9.00 (s. 
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1 H), 8.03 (m, 3 H), 7.37 (s, 1 H), 7.26 (m, 2 H), 7.14 (d, \H,J= 1.7 Hz). 7.12 (m, 2 H), 7.02 
(d, 1 H), 6.93 (d, 1 H), 6.88 (d, 1 H), 6.82 (d, 1 H), 6.72 (d. 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 
3.77 (s, 3 H), 3.76 (s, 3 H), 3.74 (s, I H). 3.36 (q, 2 H, J= 5.4 Hz), 3.22 (q, 2 H, 7 = 5.9 Hz), 
3.09 (q, 2 H, J = 5.5 Hz), 2.98 (quintet, 2 H, 7 = 5.3 Hz), 2.70 (d, 6 H, 7 = 4.3 Hz), 2.34 (m, 4 
H), 1.78 (m, 4 H); MALDI-TOF-MS (monoisotopic), 994.2 (993.5 calc. for C47H6iN,609). 

Table 5. Mass spectral characterization of Op and Hp polyamides, synthesized and purified as 
described for ImlmOpPy-Y-ImPyPyPy-P-Dp and ImlmHpPy-Y-ImPyPyPy-P-Dp. 



POT YAMiniP 








ImOpPy-y-PyPyPy-p"Dp 


C48H63N16O9 


1007.5 


1007.5 


ImHpPy-Y-PyPyPy-P-Dp 


C47H6,N,609 


993.5 


993.2 


ImPyOp-y-PyPyPy-P-Dp 


C48H63N,609 


1007.5 


1007.5 


ImPyHp-y-PyPyPy-p-Dp 


C47H61N16O9 


993.5 


993.4 


ImPyPy-y-OpPyPy-p-Dp 


C48H63N,609 


1007.5 


1007.6 


ImPyPy-y-HpPyPy-P-Dp 


C47H6,N,609 


993.5 


993.2 


ImPyPy-Y-PyOpPy-P-Dp 


C48H63N,609 


1007.5 


1007.5 


ImPyPy-y-PyHpPy-P-Dp 


C47H6,N,609 


993.5 


993.4 


ImOpOp-y-PyPyPy-p-Dp 


C49H65NUO10 


1037.5 


1037.5 


ImHpHp-y-PyPyPy-p-Dp 


C47H6tNi6Oi0 


1009.5 


1009.4 


ImlmOpPy-y-ImPyPyPy-p-Dp 


CS8H72N22O,, 


1253.6 


1253.5 


ImlmHpPy-y-ImPyPyPy-P-Dp 


Cs7H7,N220„ 


1239.6 


1239.6 


ImlmPyPy-y-ImOpPyPy-p-Dp 


Cs8H72N220n 


1253.6 


1253.6 


ImlmPyPy-Y-ImHpPyPy-P-Dp 


Cs7H7,N220u 


1239.6 


1239.6 


ImOpPyPy-y-ImOpPyPy-P-Dp 


C6oH76N2,Oi2 


1282.6 


1282.6 


ImHpPyPy-y-ImHpPyPy-p-Dp 


C5gH72N2,Oi2 


1254.6 


1254.6 


ImlmOpPy-y-ImOpPyPy-P-Dp 


C59H75N220,2 


1283.6 


1283.6 


ImlmHpPy-y-ImHpPyPy-P-Dp 


C57H7,N220,2 


1255.6 


1255.5 


ImOpPyPy-y-PyPyPyPy-P-Dp 


C60H75N20OH 


1251.6 


1251.5 


ImPyPyPy-y-PyPyOpPy-p-Dp 


C60H75N20O1 1 


1251.6 


1251.5 


ImlmPyPy-y-ImPyOpPy-P-Dp 


C58H72N22O1, 


1253.6 


1253.7 


ImOpPyPyPy-y-ImOpPyPyPy-P-Dp 


C72H88N250,4 


1526.7 


1526.6 


ImHpPyPyPy-y-ImHpPyPyPy-P-Dp 


C70H84N25O,4 


1498.7 


1498.0 


ImlmPyPyPy-y-ImOpOpPyPy-P-Dp 


C7lH87N260,4 


1527.7 


1527.7 
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EXAMPLE 3: 
DETERMINATION OF POLY AMIDE 
BINDING AFFINITY AND SEQUENCE SPECIFICITY. 

5 Representative- footprint titration experiments are shown in Figures 3 and 10. A 252-bp 

DNA fragment which is typically used for the footprint titration experiments provides 247 
possible 6-bp binding sites for an eight-ring hairpin poiyamide. Thus, in addition to providing 
DNA binding affinities, the footprint titration experiments also reveal DNA binding sequence- 
specificity. The DNA binding sequence specificity of polyamides which differ by a single 

10 Py/Py, Hp/Py, or Py/Hp pair for sites which differ by a single A*T or T*A base pair are 
described in Tables 1, 6, and 7. 

Quantitative DNase /Footprint Titrations All reactions were executed in a total volume 
of 400 ^iL (Brenowitz, M. et aL, 1986). A poiyamide stock solution or H2O (for reference 

15 lanes) was added to an assay buffer containing 3'-^^? radiolabeled restriction fragment (20,000 
cpm), affording final solution conditions of 10 mM Tris»HCl. 10 mM KCl, 10 mM MgCh, 5 
mM CaCl2, pH 7.0, and either (i) a suitable concentration range of poiyamide, or (ii) no 
poiyamide (for reference lanes). The solutions were allowed to equilibrate for 24 hours at 22**C. 
Footprinting reactions were initiated by the addition of 10 |iL of a stock solution of DNase I (at 

20 the appropriate concentration to give --55% intact DNA) containing 1 mM dithiothreitol and 
allowed to proceed for 7 minutes at 22**C. The reactions were stopped by the addition of 50 ^L 
of a solution containing 2.25 M NaCl, 150 mM EDTA, 23 \xM base pair calf thymus DNA, and 
0.6 mg/ml glycogen, and ethanol precipitated. The reactions were resuspended in 1 x TBE/ 
80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice. 

25 The reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% 
crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and 
then exposed to a storage phosphor screen at 22°C. 

Photostimuable storage phosphor imaging plates (Kodak Storage Phosphor Screen 
30 SO230 obtained from Molecular Dynamics) were pressed flat against dried gel samples and 
exposed in the daric at 22*C for 12-24 hours. A Molecular Dynamics 400S Phosphorlmager 
was used to obtain all data from the storage screens (Johnston et al., 1990). The data were 
analyzed by performing volume integration of the target site and reference blocks using the 
ImageQuant v. 3.3 software running on a Compaq Pentium 80. 

35 

Quantitative DNase I Footprint Titration Data Analysis was performed by taking a 
background-corrected volume integration of rectangles encompassing the footprint sites and a 
reference site at which DNase I reactivity was invariant across the titration generated values for 
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10 



15 



the site intensities (I^itc) and the reference intensity (I„f), The ^parent fractional occupancy 
(6app) of the sites were calculated using the equation: 



/•if«//r^o 



(1) 



where Ijue*^ and Inf" are the site and reference intensities, respectively, from a DNase I control 
lane to which no polyamide was added. 

The ([L]tot, 9app) data were fit to a Langmuir binding isotherm (eq, 2, n=l) by 
minimizing the difference between Gapp and Gfit, using the modified Hill equation: 

Ka" [L]" tot 



9m 



Gmin + (Gmax - Gmin ) 



(2) 



1 + K.^ILfio, 

where [Lk,,] is the total polyamide concentration, is the equilibrium association constant, and 
Gmin and 6 

max are the experimentally determined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure of KaleidaGraph software (v. 3.0.1, Abelbeck Software) with K^, 6^, and Q^^ as the 
adjustable parameters. The goodness of fit of the binding curve to the data points is evaluated 
by the correlation coefficient, with R > 0.97 as the criterion for an acceptable fit. Four sets of 
acceptable data were used in determining each association constant. All lanes from a gel were 
used unless a visual inspection revealed a data point to be obviously flawed relative to 
neighboring points. The data were normalized using the following equation: 

B>PP " 9mrn 

6 norm = 



dmax - Gnr 



20 



TABLE 6 Discrimination of y-TGTAA-y and S'-TOTTA-y'' 
Pair- 5'-TGTAA-y 5^TGTTA-3- ^rri^ 



Py/Py 



5'-T G T 



3'-A C A 



A-3' 



T-5' 



5'-T G T _ _ 

fK><>o|OK 

3'-A C A 



A-3' 



T-5' 



^Tj =0.014 ^M 



= 0.001 (iM 



2.0 



5'-T G T A A-3' 5'-T G Tf»lA-3' 

Py/Hp ,t^KX>®^ -eKKX>S^ 0.36 



Hp/Py 



5'-T G T\K 




3'-A C aIt 



A-3' 5'-T G TfT]A-3' 
T-5* 3'-A C aIa|t-5' 



14 



*The reported equilibriiun dissociaticn constants are the mean values 
obtained from two DNase I footprint titration experiments on a 3' ^P 
labeled 370-bp pDEHl EcoRl/PouII DNA restriction fragment". The 
assays were carried out at 22 *C pH 7.0 in the presence of 10 mM 
Tris-HQ, 10 mM KCl 10 mM MgCla, and 5 mM CaClj. 
tRing pairing opposite T»A and A*T in the third position. 
^Calculated as X.(5'-TGTAA.3')/K.(5'-TGTrA-3'). 
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TABLE 7 Discrimination of S'-TGITT-y and 5'-TGATT-3'* 
Pairt y-TGATT-y y>TGTTr-y O" 



Py/Py 



5*-T G 




3'-A C 




0.005 »iM 



5^ 



Hp/Py 



T T-3' 




X'^ = 0.53jiM 



5'-T G[1|t T-3' 
• (SO^ 

3'-A CygA A-5* 
ATj^ 0.008 



Py/Hp 



T T-3' 



5'-T G 

3'-A ClilA A- 
^d = 0.33nM 



5' 




0,56 



"The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments. The assays 
were carried out at 22 pH 7.0 in the presence of 10 mM Tris»HCl 
10 mM KCl, 10 mM Mga2, and 5 mM CaOo. 
TRing pairing opposite T«A and A-T in the mird position. 
^Calculated as K^iS'-TCATT -3')/Kd(5'.TGTTT.3'). 



EXAMPLE 5: 

5 PREPARATION OF A BIFUNCTIONAL Hp-Py-Im-POLYAMIDE. 

ImImOpPy-y-rmPyPyPy-^-Dp-NH2. ImlmOpPy-y-ImPyPyPy-P-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam- 
Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 

10 polyamides containing imidazole and pyrrole amino acids. J. Am. Chem, Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyiTole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 
occurs. A sample of ImlmOpPy-y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was 

15 placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3'-diamino-Ar- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resuhing 
solution purified by reversed phase HPLC. ImImOpPy.Y-ImPyPyPy-p-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (93 mg, 46% recovery). UV 

20 (H2O) Xmax 246, 316 (66,000); NMR (DMSO-rf^) 5 10.34 (s, 1 H), 10.30 (br s, 1 H), 10.25 
(s, 1 H), 9.96 (s, 1 H), 9.95 (s, 1 H), 9.89 (s, 1 H), 9.24 (s, 1 H), 9.1 1 (s, 1 H), 8.08 (t, 1 H, 
5.6 Hz), 8.0 (m, 5 H), 7.62 (s, 1 H), 7.53 (s, 1 H), 7.42 (s, 1 H), 7.23 (d. IH, J - 1.2 Hz), 7.21 
(m, 2 H), 7.15 (m, 2 H), 7.13 (d, 1 H), 7.1 1 (m, 2 H), 7.04 (d, 1 H), 6.84 (m, 3 H). 3.98 (s. 3 H), 
3.97 (s. 3 H), 3.92 (s, 3 H), 3.82 (m, 6 H), 3.80 (s, 3 H), 3.77 (d, 6 H), 3.35 (q, 2 H, J= 5.8 Hz) 

25 3.0-3.3 (m, 8 H), 2.86 (q, 2 H, 7 = 5.4 Hz). 2.66 (d, 3 H, 7 = 4.5 Hz), 2.31 (m, 4 H), 1.94 
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(quintet, 2H,J= 6.2 Hz), 1.74 (m, 4 H); MALDI-TOF-MS (monoisotopic). 1296.0 (1296.6 
calc. forC60H78N23Oii). 

ImlmOpPy-y-ImPyPyPy-^-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
5 in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 "C for 5 min. The dianhydride 
solution was added to ImImOpPy-y-ImPyPyPy.p-NH2 (13 mg, 10 jimol) dissolved in DMSO 
(750 ^L). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydrolyzed (O.IM NaOH, 3 mL, 55 "C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
10 provide ImlmOpPy-v-ImPyPyPy-p-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.5 mg, 40% recovery). MALDI-TOF-MS (monoisotopic), 1570.9 
(1570.7 calc. for C70H92N25O18). 

ImlmHpPy-y.ImPyPyPy-^-Dp-EDTA. In order to remove the methoxy protecting group, 
15 a sample of ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA (5 mg, 3. 1 jimol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 jiL) and thiophenol (500 jiL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culnire tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA (5 mg) dissolved in DMF (500 jtL) was added. The 
20 solution was agitated, and placed in a 100 "C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. ImlmHpPy-y-ImPyPyPy-P-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
25 recovery). UV (H2O) Xmax 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1556.6 
(1556.7 calc. for C69H90N25O18). 

ImImPyPy-y-ImOpPyPy-^-Dp-NH2. ImlmPyPy-y-ImOpPyPy-p-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam- 

30 Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 
polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 ramol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 

35 occurs. A sample of ImlmPyPy-y-ImOpPyPy-p-Pam-Resin (400 mg, 0.40 ramol/gram) was 
placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3'-diamino-//- 
methyldipropylamine (2 mL) and heated (55 X) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
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solution purified by reversed phase HPLC. ImIniPyPy-y-ImOpPyPy.|l-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (104 mg, 54% recovery). UV 
(H2O) kmax 246, 316 (66,000); MALDI-TOF-MS (monoisotopic). 1296.6 (1296.6 calc. for 
C6OH78N23O11). 

5 

ImlmPyPy-y-ImOpPyPy-^-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 for 5 min. The dianhydride 
solution was added to IniImPyPy-y-ImOpPyPy-P-NH2 (13 mg, 10 ^imol) dissolved in DMSO 
(750 nL). The mixture was heated (55 "^C, 25 min.) and the remaining EDTA-anhydride 
10 hydrolyzed (O.IM NaOH, 3 mL, 55 **C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
provide ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.9 mg, 42% recovery). MALDI-TOF-MS (monoisotopic), 1570.8 
(1 570.7 calc. for C70H92N25O1 g). 

15 

ImlmPyPy'y'ImHpPyPy-^-Dp'EDTA, In order to remove the methoxy protecting group, 
a sample of ImlmPyPy-y-ImOpPyPy-P-Dp-EDTA (5 mg, 3.1 ^mol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 nL) and thiophenol (500 \iL) were placed in a (13 
X 100 nmi) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 

20 mineral oil (100 mg) was j/ow(y added. Upon completion of the addition of the sodium hydride, 
ImlmPyPy-y-ImOpPyPy-P-Dp-EDTA (5 mg) dissolved in DMF (500 fiL) was added. The 
solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to O'^C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 

25 solution and purified by reversed phase HPLC. ImlmPyPy-y-lmHpPyPy-P-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
recovery). UV (H2O) Xmax 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1555.9 
(1556.7 calc. for C69H90N25O18). 

30 EXAMPLE 6: 

DETERMINATION OF POLY AMIDE BINDING ORIENTATION 

Affinity cleavage experiments using hairpin polyamides modified with EDTA*Fe(II) at 
either the C-terminus or on the y-tum, were used to determine polyamide binding orientation 
35 and stoichiometry. The results of affinity cleavage experiments are consistent only with 
recognition of 6-bp by an 8-ring hairpin complex and rule out any extended 1:1 or overlapped 
complex formation. In addition, affinity cleavage experiments reveal hairpin formation 
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supporting the claim that it is the Hp/Py and Py/Hp pairing which form at both match and 
mismatch sites to discriminate A»T from T»A, 

Affinity cleavage reactions were executed in a total volume of 40 \iL, A stock solution of 
5 polyamide or H2O was added to a solution containing labeled restriction fragment (20,000 
cpm), affording final solution conditions of 25 mM Tris-Acetate, 20 mM NaCl, 100 jiM/bp calf 
thymus DNA, and pH 7.0. Solutions were incubated for a minimum of 4 hours at 22*^C. 
Subsequently, 4 ^iL of freshly prepared 100 Fe(NH4)2(S04)2 was added and the solution 
allowed to equilibrate for 20 min. Cleavage reactions were initiated by the addition of 4 |iL of 
10 100 mM dithiothreitol, allowed to proceed for 30 min at 22 **C, then stopped by the addition of 
10 ^L of a solution containing 1.5 M NaOAc (pH 5.5), 0.28 mg/mL glycogen, and 14 |iM base 
pairs calf thymus DNA, and ethanol precipitated. The reactions were resuspended in Ix 
TBE/80% formamide loading buffer, denatured by heating at 85 for 15 min, and placed on 
ice. The reaction products were separated by electrophoresis on an 8% polyacrylamide gel 
15 (5% cross-link, 7 M urea) in Ix TBE at 2000 V for 1.5 hours. Gels were dried and exposed to a 
storage phosphor screen. Relative cleavage intensities were determined by volume integration 
of individual cleavage bands using ImageQuant software. 
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EXAMPLE 7: 

IMPROVEMENT TO POLYAMIDE SEQUENCE SPECIFICITY. 



10 



15 



20 



The polyamides- of this invention provide improved specificity relative to existing 
polyamide technology. Turner, J. T., Baird, E. E., and Dervan, P.B. describe the recognition of 
seven base pair sequences in the minor groove of DNA by ten-ring pyrtole-imidazole 
polyamide hairpins 1 Am. Chem. Soc. 1997 JJ9, 7636. For example, quantitative DNasel 
footprint titrations reveal that the lO-ring hairpin ImPyPyPyPy.Y-ImPyPyPyPy.p.Dp binds a 5'- 
TGTAACA-3- sequence with an equlibrium dissociation constant of 0.083 nM, and 18-fold 
specificity versus a single base mismatch site. A number of other sites are also bound on the 
252-bp DNA firagment used for the footprint titration experiments. (Figure 13). Introduction of 
a Hp/Py and Py/Hp pair in the 10-ring polyamide, ImHpPyPyPy-y-ImHpPyPyPy.p-Dp, to 
recognize a T*A and A-T within the 7-bp target sequence, increases the sequence-specificty. For 
example, a single base mismatch site 5'-TGGAACA-3 is discriminated by > 5000-fold (Figure 
13, Table 8). In fact all 245 7-bp mismatch sites present on the restriction fi-agment are 
discriminated > 5000-fold by the polyamide ImHpPyPyPy-y.ImHpPyPyPy-p-E>p (Figure 13). 
For cases where three A,T base pairs are present in succession it is preferred to substitute Py/Py 
in place of at least one Hp/Py or Py/Hp to provide for recognition of A-T and T-A at a single 
position. 



TABLE 8 Equilibrium dissociation constants* 



Polyamidet 



5'-TGGTCA-3' 



5'-TGGACA-3' 



5'-T O 



Py/Py 



3'.A C 



#0OCCk 

•eKK>doc#^ 



C A-3' 5*-T 6 



+>0Ot 

3'-A Cf 



C A-3' 



18 



Q T-5' 



/rd= 1.5 nM 



5*-T O T a[a]c A-3' 
• ®OCCK 

Hp/Py ^H>O0O®#^ 

3*-A C [aJtItJo T-5' 

Ka = 02 nM 



5'-T O^T A C A-3* 

•^<><>PO®^^ >5000 
3'-A C [C]a[t]0 t-5* 

> 1000 nM 



*The reported dissociation constants are the average values obtained from three 
DNase 1 footprint titration experiments. The standard deviation for each data set is 
less than 15% cf the reported number. Assays were carried out in the presence of 10 
mM Tris-HQ, 10 mM KCl, 10 mM MgCl^ and 5 mM CaC12 at pH 7.0 and 22 'C. 
tRing pairing opposite !• A and A»T in the fourth position. 
♦Calculated as Xd(5'-TGGTACA-3')/Kjj(5'-TGTAACA-3'). 



25 EXAMPLE 8: 

USE OF PAIRING CODE 

There are 256 possible four base pair combinations of A, T, G, and C. Of these, there are 
a possible 240 four base pair sequences which contain at least 1 A^^ or T»A base pair and 
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therefore can advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides 
binding to any of these sequences can be designed in accordance with the code of TABLE 2. 
Table 9 lists the sixteen eight-ring hairpin polyamides (1-16) which recognize the sixteen 5'- 
WGTNNW-3' sequences (W = A or T, X = A, G, C. or T). Table 10 lists the sixteen eight-ring 
hairpin polyamides (17-32) which recognize the sixteen 5'-WGANNW-3' sequences (17-32). 
Table 1 1 lists the twelve eight-ring hairpin polyamides (33-44) which recognize twelve 5'- 
WGGNNW-3' sequences which contain at least one A,T base pair. Table 1 1 Usts the four eight- 
ring hairpin polyamides (G1-G4) which target the four 5'-WGGNNW-3' sequences {G1-G4) 
which contain exclusively G»C base pairs. Table 12 lists the twelve eight-ring hairpin 
polyamides (45-56) which recognize twelve 5'-WGCNNW-3* sequences which contain at least 
one A,T base pair. Table 12 lists the four eight-ring hairpin polyamides (G5-G8) which target 
the four 5'-WGCNNW-3' sequences (G5-G8) which contain exclusively G«C base pairs. Table 
13 lists the sixteen eight-ring hairpin polyamides (57-72) which recognize the sixteen 5'- 
WTTNNW-3' sequences (57-72). Table 14 lists the sixteen eight-ring hairpin polyamides (73- 
15 88) which recognize the sixteen 5'-WTANNW-3' sequences (73-88). Table 15 lists the sixteen 
eight-ring hairpin polyamides (89-104) which recognize the sixteen 5'-WTGNNW-3' sequences 
(89-104). Table 16 lists the sixteen eight-ring hairpin polyamides (105-120) which recognize 
the sixteen 5'-WTCNNW.3' sequences (105-120). Table 17 lists the sixteen eight-ring hairpin 
polyamides (121-136) which recognize the sixteen 5'-WATNNW.3' sequences (121-136). 
20 Table 1 8 lists the sixteen eight-ring hairpin polyamides (137-152) which recognize the sixteen 
5'-WAANNW-3' sequences (137-152). Table 19 lists the sixteen eight-ring hairpin polyamides 
(153-168) which recognize the sixteen 5'-WAGNNW-3' sequences (153-168). Table 20 lists 
the sixteen eight-ring hairpin polyamides (169-184) which recognize the sixteen 5'-WACNNW- 
3' sequences (169-184). Table 21 lists the sixteen eight-ring hairpin polyamides (185-200) 
25 which recognize the sixteen 5'-WCTNNW-3' sequences (185-200). Table 22 lists the sixteen 
eight-ring hairpin polyamides (201-216) which recognize the sixteen 5'-WCANNW-3' 
sequences (201-216). Table 23 lists the twelve eight-ring hairpin polyamides (217-228) which 
recognize the twelve 5'-WCGNNW-3' sequences which contain at least one A,T base pair. 
Table 23 lists the four eight-ring hairpin polyamides (G9-G12) which target the four 5'- 
30 WCGNNW-3' sequences (G9-G12) which contain exclusively C»G base pairs. Table 24 lists 
the twelve eight-ring hairpin polyamides (229-240) which recognize the twelve 5'-WCCNNW- 
3' sequences which contain at least one A,T base pair. Table 24 lists the four eight-ring hairpin 
polyamides (G13-G16) which target the four 5'-WCCNNW-3' sequences (G13-G16) which 
contain exclusively C*G base pairs. 
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TABLE 9: 8-ririg Hairpin Polyamidcs for recognition of 6-bp 5'-WGTNNWO' 
5 DNA sequence aromatic amino acid sequence 



1) 


5- 


-W 


G 


T 


T 


T 


W-3' 


1) ImHpHpHp-Y-PyPyPyPy 


2) 


5' 


-W 


0 


T 


T 


A 


W-3' 


2) ImHpHpPy-y-HpPyPyPy 


3) 


5* 


-w 


G 


T 


T 


G 


W-3' 


3) IrnHpHpIm-y-PyPyPyPy 


4) 


5» 


-w 


G 


T 


T 


C 


W-3' 


4) ImHpHpPy-Y-ImPyPyPy 


5) 


5* 


-w 


G 


T 


A 


T 


W-3' 


5) imHpPyHp-y-PyHpPyPy 


6) 


5' 


-w 


G 


T 


A 


A 


W-3' 


6) ImHpPyPy-y-HpHpPyPy 


7) 


5* 


-w 


G 


T 


A 


G 


W.3' 


7) ImHpPylm-y-PyHpPyPy 


8) 


5' 


-w 


G 


T 


A 


C 


W-3' 


8) ImHpPyPy-y-ImHpPyPy 


9) 


5' 


-w 


G 


T 


G 


T 


W-3 • 


9) ImHpImHp-y-PyPyPyPy 


1 0^ 
±\J } 






u 


T 


\3 


/\ 


n — J 




11) 


5' 


-w 


G 


T 


G 


G 


W-3' 


11) ImHpImlm-y-PyPyPyPy 


12) 


5 


-w 


G 


T 


G 


C 


W-3' 


12) ImHpImPy-y-ImPyPyPy 


13) 


5 


-w 


G 


T 


C 


T 


W-3' 


13) ItnHpPyHp-y-PylmPyPy 


14) 


5 


-w 


G 


T 


C 


A 


W-3' 


14) ImHpPyPy-y-HpImPyPy 


15) 


5 


•-W 


G 


T 


C 


G 


W-3' 


15) ItnHpPylm-y-PylmPyPy 


16) 


5 




G 


T 


C 


C 


W-3' 


16) ImHpPyPy-y-ImlmPyPy 
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TABLE 10: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WGANNW-3* 
DNA sequence aromatic amino acid sequence 



5 


17) 


5' 


-W 


Q 


A 


T 


T 


W-3 ' 


17) ImPyHpHp-y-PyPyHpPy 




18) 


5' 


-w 


6 


A 


T 


A 


W-3» 


18) ImPyHpPy-y-HpPyHpPy 


10 


19) 


5' 


-w 


0 


A 


T 


G 


W-3 • 


19) ImPyHpIm-y-PyPyHpPy 




20) 


5' 


-w 


G 


A 


T 


C 


W-3' 


20) ImPyHpPy-y-ImPyHpPy 




21) 


5" 


-w 


0 


A 


A 


T 


W-3' 


21) ImPyPyHp-y-PyHpHpPy 


15 


22) 


5' 


-w 


6 


A 


A 


A 


W-3' 


22) ImPyPyPy-y-HpHpHpPy 




23) 


5< 


-w 


G 


A 


A 


G 


W-3' 


23) ImPyPylm-y-PyHpHpPy 


20 


24) 


5' 


-w 


G 


A 


A 


C 


W-3 • 


24) ImPyPyPy-y-ImHpHpPy 




25) 


5' 


-w 


G 


A 


G 


T 


W-3 ' 


25) ImPylmHp-y-PyPyHpPy 




26) 


5' 


-w 


G 


A 


G 


A 


W-3» 


26) ImPylmPy-y-HpPyHpPy 


25 


27) 


5' 


-w 


G 


A 


G 


6 


W-3' 


27) ImPylmlm-y-PyPyHpPy 




28) 


5 


-w 


G 


A 


6 


C 


W.3' 


28) ImPylmPy-y-ImPyHpPy 


30 


29) 


5 


-w 


G 


A 


C 


T 


W-3' 


29) ImPyPyHp-y-PylmHpPy 




30) 


5 


-w 


G 


A 


C 


A 


W-3» 


30) ImPyPyPy-y-HpImHpPy 




31) 


5 


-w 


G 


A 


C 


G 


W-3' 


31) ImPyPylm-y-PylmHpPy 


35 


32) 


5 


' -w 


G 


A 


C 


C 


W-3 ' 


32 ) ImPyPyPy-y - ImlmHpPy 
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TABLE 1 1 : 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WGGNNW-3' 
DNA sequence aromatic amino acid sequence 



5 


33) 


5' 


-W 


G 


G 


T 


T 


W-3' 


33) 


ImlmHpHp - y- PyPyPyPy 




34) 


5' 


-W 


Q 


G 


T 


A 


W-3« 


34) 


ImlmHpPy-y-HpPyPyPy 


10 


35) 


5' 


-w 


G 


G 


T 


G 


W-3« 


35) 


ImlmHpIm-y-PyPyPyPy 




36) 


5' 


-w 


G 


G 


T 


C 


W-3' 


36) 


ImlmHpPy-y- ImPyPyPy 




37) 


5' 


-w 


G 


G 


A 


T 


W-3« 


37) 


ImlmPyHp -y - PyHpPyPy 


15 


38) 


5' 


-w 


G 


G 


A 


A 


W-3 • 


38) 


ImlmPyPy-y-HpHpPyPy 




39) 


5' 


-w 


G 


G 


A 


G 


W-3 ' 


39) 


ImlmPylm-y- PyHpPyPy 


20 


40) 


5' 


-w 


G 


G 


A 


C 


W-3' 


40) 


ImlmPyPy-y- ImHpPyPy 




41) 


5" 


-w 


G 


G 


G 


T 


W-3< 


41) 


ImlmlmHp -y- PyPyPyPy 




42) 


5' 


-w 


G 


G 


G 


A 


W-3' 


42) 


ImlmlmPy-y-HpPyPyPy 


25 


43) 


5* 


-w 


G 


G 


C 


T 


W-3' 


43) 


ImlmPyHp -y- PylmPyPy 




44) 


5' 


-w 


G 


G 


c 


A 


W-3' 


44) 


ImlmPyPy-y-HpImPyPy 


30 


Gl) 


5" 


-w 


G 


G 


G 


G 


W-3' 


Gl) 


Imlmlmlm-y- PyPyPyPy 




G2) 


5' 


-w 


G 


G 


G 


C 


W-3 ' 


G2] 


ImlmlmPy-y - ImPyPyPy 




G3) 


5' 


-w 


G 


G 


C 


G 


W-3' 


G3) 


ImlmPylm-y- PylmPyPy 


35 


G4) 


5' 




G 


G 


C 


C 


W-3 ' 


G41 


ImlmPyPy-y- ImlmPyPy 
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TABLE 12: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGCNNW-3* 











DNA sequence 


aromatic amino acid sequence 


5 




45) 


5' 


-W G C T T 




45) ImPyHpHp-Y-PyPyimPy 






46) 


5' 


-W G C T A 


W-3» 


46) ImPyHpPy-y-HpPylmPy 






47) 


5 ' 


-W G C T G 


W-3 ' 


47) ImPvHnlm-v-PvPvImPv 




















48) 


5' 


-W G C T C 


W-3' 


48) ImPyHpPy-y-ImPylmPy 






49) 


5' 


-W G C A T 


W-3» 


4 9) ImPyPyHp-Y-PyHpImPy 


15 




50) 


5" 


-W G C A A 


W-3« 


50) ImPyPyPy-y-HpHpImPy 






51) 


5' 


-W G C A G 


W.3' 


51) ImPyPylm-y-PyHpImPy 






3« / 




-W G C A C 


W-3 ' 






















53) 


5 


'-W G C G T 


W-3' 


53) ImPylmHp-y-PyPylmPy 






54) 


5 


' -W G C G A 


W-3« 


54) ImPylmPy-y-HpPylmPy 








c 


• -W G C C T 


W-3' 


xiniry iryri^ " ]^ ~ iinxnitry 






56) 


5 


G C C A 


W-3' 


56) ImPyPyPy-y-HpImlmPy 






G5) 


5 


*-W G C G G 


W-3' 


G5) ImPyltnlm-y-PyPylmPy 


30 


















G6) 


5 


'-W G C G C 


W-3' 


G6) ImPylmPy-y-ImPylmPy 






G7) 


5 


O C C G 


W-3' 


G7) ImPyPylm-y-PylmlmPy 


35 




G8) 


5 


G C C C 


W-3' 


GB) ImPyPyPy-y-ImlmlmPy 
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DNA sequence 


5 


57) 


5 


' -W T T T T W-3 




58) 


5 


• -W T T T A W-3 




59) 


5 


T T T G W-3 


10 










60) 


5 


'-W T T T C W-3 




61) 


5 


•-W T T A T W-3 


15 


62) 


5 


• -W T T A A W-3 




63) 


5 


'-W T T A 0 W-3 




.64) 


5 


•-W T T A C W-3 












65) 


5 


•-W T T G T W-3 




66) 


5» 


-W T T G A W-3' 


25 


67) 


5» 


-W T T G G W-3» 




68) 


5' 


-W T T G C W-3' 




69) 


5' 


-W T T C T W-3' 


30 










70) 


5* 


-W T T C A W-3' 




71) 


5' 


-W T T C G W-3 • 


35 


72) 


5' 


-W T T C C W-3' 



aromatic amino acid sequence 



57) HpHpHpHp-Y-PyPyPyPy 

58 ) HpHpHpPy-y-HpPyPyPy 

5 9 ) HpHpHpIm-y- PyPyPyPy 

60 ) HpHpHpPy-Y - ImPyPyPy 

6 1 ) HpHpPyHp - y- PyHpPy Py 

62 ) HpHpPyPy-y-HpHpPyPy 

63 ) HpHpPylm-y-PyHpPyPy 

64 ) HpHpPyPy-y- ImHpPyPy 

65 ) HpHpImHp-y- PyPyPyPy 

66 ) HpHpImPy-y-HpPyPyPy 

67 ) HpHp I m I m-y- PyPyPyPy 

68) HpHpImPy-Y-ImPyPyPy 

6 9 ) HpHpPyHp -y - Py ImPy Py 

70) HpHpPyPy-y-HpImPyPy 

7 1 ) HpHpPy Itn-y - Py ImPyPy 

7 2 ) HpHpPyPy-y - ImlmPyPy 
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TABLE 14: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTANNW-3' 



DNA sequence aromatic amino acid sequence 



5 


73) 


5' 


-W 


T A 


T 


T 


W-3« 


73)HpPyHpHp-Y-PyPyHpPy 




74) 


5* 


-W 


T 71 


T 


A 


W-3' 


74 ) HpPyHpPy-y-HpPyHpPy 




75) 


5« 


-W 


T A 


T 


G 


W-3 ' 


75 ) HpPyHpIm-y-PyPyHpPy 


10 




















76) 


5' 


-w 


T A 


T 


C 


W-3' 


76 ) HpPyHpPy-y-ImPyHpPy 




77) 


5' 


-w 


T A 


A 


T 


W-3' 


77) HpPyPyHp-Y-PyHpHpPy 


15 


78) 


5« 


-w 


T A 


A 


A 


W.3' 


7 8 ) HpPyPyPy-Y-HpHpHpPy 




79) 


5' 


-w 


T A 


A 


G 


W-3« 


79 ) HpPyPylm-Y - PyHpHpPy 




80) 


5' 


-w 


T A 


A 


C 


W-3 ' 


80 ) HpPyPyPy-y- ItnHpHpPy 


20 




















81) 


5' 


-w 


T A 


G 


T 


W.3' 


8 1) HpPyimHp-Y- PyPyHpPy 




82) 


5' 


-w 


T A 


Q 


A 


W-3» 


82 ) HpPylmPy-Y-HpPyHpPy 


25 


83) 


5 


-w 


T A 


6 


G 


W-3' 


83 ) HpPylmlm-Y- PyPyHpPy 




84) 


5 


' -w 


T A 


6 


C 


W.3' 


84 ) HpPylmPy-Y - ImPyHpPy 




85) 


5 


■-W 


T A 


C 


T 


W.3' 


8 5 ) HpPy PyHp - y - Py ImHpPy 


30 




















86) 


5 




T A 


C 


A 


W.3' 


8 6 ) HpPy Py Py - y - Hp ImHpPy 




87) 


5 




T A 


C 


G 


W-3' 


87) HpPyPylm-Y-PylmHpPy 


35 


88) 


5 




T A 


C 


C 


W-3' 


88) HpPyPyPy-y- ImlmHpPy 
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TABLE 15: S-ring Hairpin Polyamides for recognition of 6-bp 5'-WTGNNW-3' 



DNA sequence aromatic amino acid sequence 



5 


89) 


5' 


-W 


T 


0 


T 


T 


W-3' 


69) 


HpImHpHp-y- PyPyPyPy 




90) 


5* 


-w 


t' 


G 


T 


A 


W-3' 


90) 


Hp IraHp Py - y - HpPy Py Py 




91) 


5* 


-w 


T 


G 


T 


G 


W-3' 


91) 


Hp IraHp I m - y - Py Py Py Py 


10 
























92) 


5' 


-w 


T 


G 


T 


C 


W-3" 


92) 


HpIinHpPy-y- ImPyPyPy 




93) 


5» 


-w 


T 


G 


A 


T 


W-3» 


93) 


HpImPyHp -y- PyHpPyPy 


15 


94) 


5' 


-w 


T 


G 


A 


A 


W-3' 


94) 


Hp ImPy Py -y - HpHpPy Py 




95) 


5» 


-w 


T 


G 


A 


G 


W-3' 


95) 


HpImPylm-y- PyHpPyPy 




96) 


5' 


-w 


T 


G 


A 


C 


W-3' 


96) 


HpImPyPy-y- ImHpPyPy 


























97) 


5' 


-w 


T 


G 


G 


T 


W-3' 


97) 


Hp ImlmHp - Y - Py PyPy Py 




98) 


5" 


-w 


T 


G 


G 


A 


W-3' 


98} 


HpImltnPy-y-HpPyPyPy 


25 


99) 


5' 


-w 


T 


G 


C 


T 


W-3» 


99) 


HpImPyHp-y-PylmPyPy 




100) 


5 


-w 


T 


G 


C 


A 


W-3' 


100] 


Hp ImPy Py - y - Hp I mPy Py 




101) 


5 


' -w 


T 


G 


G 


G 


W-3' 


101 


Hp Imlmlm-y - Py Py PyPy 


30 
























102) 


5 


' -w 


T 


G 


G 


C 


W-3' 


102 


HpImlmPy-y- ImPy PyPy 




103) 


5 


• -w 


T 


G 


C 


G 


W-3' 


103) HpImPylm-y-PylmPyPy 


35 


104) 


5 


• -w 


T 


G 


C 


C 


W-3' 


104 


1 HpImPyPy-y-ImlmPyPy 
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TABLE 16: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTCNNW-3' 
DNA sequence aromatic amino acid sequence 



5 


105) 


5' 


-W 


T 


C 


T 


T 


W-3' 


105 ) HpPyHpHp-y- PyPylmPy 




106) 


5> 


-w 


T 


C 


T 


A 


W-3» 


106) HpPyHpPy-y-HpPylmPy 


in 


107) 


5« 


-w 


T 


C 


T 


G 


W-3» 


1 0 7 ) HpPyHp Im - Y - Py Py ImPy 




108) 


5" 


-w 


T 


C 


T 


C 


W-3' 


108)HpPyHpPy-y-ImPyImPy 




109) 


5' 


-w 


T 


C 


A 


T 


W.3' 


109) HpPyPyHp-Y-PyHpImPy 


15 


110) 


5' 


-w 


T 


c 


A 


A 


W.3« 


110 ) HpPyPyPy-y-HpHpImPy 




111) 


5« 


-w 


T 


c 


A 


G 


W-3' 


111) HpPyPy Im -y - PyHpImPy 




112) 


5' 


-w 


T 


c 


A 


C 


W-3' 


1 1 2 ) Hp Py Py Py - y - ImHp ImPy 




113) 


5' 


-w 


T 


c 


G 


T 


W-3' 


113 ) HpPylmHp-y- PyPylmPy 




114) 


5 


-w 


T 


c 


6 


A 


W-3' 


114 ) HpPylmPy-y-HpPylmPy 


25 


115) 


5 


-w 


T 


c 


C 


T 


W-3' 


115) HpPyPyHp-y-PylmlmPy 




116) 


5 




T 


c 


C 


A 


W-3' 


116) HpPyPyPy-y-HpImlmPy 


30 


117) 


5 


' -w 


T 


c 


G 


G 


W-3' 


117)HpPyImIm-y-PyPyImPy 




118) 


5 


»-w 


T 


c 


G 


C 


W.3' 


1 1 8 ) Hp Py I mPy -y - ImPy ImPy 




119) 


5 


• -w 


T 


c 


C 


G 


W-3' 


119) HpPyPylm-y-PylmlmPy 


35 


120) 


5 


' -w 


T 


c 


C 


C 


W-3 ' 


120) HpPyPyPy-y-ImlmlmPy 
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TABLE 17: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WAI>fNW-3' 
5 DNA sequence aromatic amino acid sequence 



121) 


5 ' 






T 


T 


T 


W-3 * 


X4 X / r'ynpxipnp - y - try try tip 


122) 


5' 


-w 


A 


T 


T 


A 


W-3 ' 


122) PvHDHDPv-v-HnPvPvHn 


123) 


5' 


-w 


A 


T 


T 


G 


W-3 ' 


1231 PvHoHnlm-v-PvPvPvHn 

J f f jn^4* Jejuni J tryirytrYrx^ 


124) 


5' 


-H 


A 


T 


T 


c 


W-3 • 


124) PvHnHnPv-v- TmPvPvHn 


125) 


5' 


-W 


A 


T 


A 


T 


W-3 ■ 


12 5) PvPnPvHn-v-PvrWrtDvH« 


126) 


5 ' 


-W 


A 


T 


A 


A 


Wr3 ' 


T^fi) P\/WnDvD\7-v-U«Ur\P\rIIrs 
X^OV trynpiryiry-y-ripripr^iip 


127) 


5 ■ 


-w 




T 


A 


Q 


W-3 • 


' / irynpr'yxin-y- t^ripiryrip 


128) 


5 ' 


-W 


A 


T 


A 


Q 


W-3 ' 


x^oj irynpryiry-y- inuipiryrip 


129) 


5 ' 


-W 


A 


T 




T 


Fl — J 


x^^j t'ynpxmnp -y- FyFyirynp 


130) 


5' 


-w 


A 


T 


G 


A 


W-3' 


130) PyHpImPy-y-HpPyPyHp 


131) 


5" 


-w 


A 


T 


6 


G 


W-3» 


131) PyHpImlm-y-PyPyPyHp 


132) 


5 


-w 


A 


T 


G 


C 


W-3' 


132) PyHpImPy-y-ImPyPyHp 


133) 


5 


-w 


A 


T 


C 


T 


W.3 ' 


133) PyHpPyHp-y-PylmPyHp 


134) 


5 


-w 


A 


T 


C 


A 


W-3' 


134) PyHpPyPy-y-HpImPyHp 


135) 


5 


-w 


A 


T 


C 


G 


W-3 ' 


135) PyHpPylm-y-PylmPyHp 


136) 


5 


-w 


A 


T 


C 


C 


W-3' 


136) PyHpPyPy-y-ImltnPyHp 
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TABLE 18: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WAAhfNW-3' 
DNA sequence aromatic amino acid sequence 



5 


137) 


5" 




A 


A 


T 


T 


W-3' 


137) PyPyHpHp-y-PyPyHpHp 




138) 


5« 


-W 


A 


A 


T 


A 


W.3» 


138) PyPyHpPy-Y-HpPyHpHp 


10 


139) 


5« 


-W 


A 


A 


T 


G 


W-3' 


139) PyPyHpIm-y-PyPyHpHp 


140) 


5' 


-W 


A 


A 


T 


C 


W-3' 


14 0 ) PyPyHpPy-y- ImPyHpHp 




141) 


5' 


-w 


A 


A 


A 


T 


W"3< 


141) PyPyPyHp-Y-PyHpHpHp 


15 


142} 


5' 


-w 


A 


A 


A 


A 


W-3< 


142) PyPyPyPy-y-HpHpHpHp 




143) 


5' 


-w 


A 


A 


A 


G 


W-3' 


143 ) PyPyPylm-y-PyHpHpHp 


20 


144) 


5' 


-w 


A 


A 


A 


C 


W-3' 


144) PyPyPyPy-y-ImHpHpHp 




145) 


5' 


-w 


A 


A 


6 


T 


W-3' 


14 5 ) PyPy ImHp -y- PyPyHpHp 




146) 


5' 


-w 


A 


A 


G 


A 


W-3' 


146) PyPylmPy-y-HpPyHpHp 


25 


147) 


5' 


-w 


A 


A 


G 


G 


W-3' 


147) PyPyImIm-y-?yPyHpHp 




148) 


5' 


-w 


A 


A 


G 


C 


W.3' 


148 ) PyPylmPy-y- ImPyHpHp 


30 


149) 


5 


-w 


A 


A 


C 


T 


W-3' 


149) PyPyPyHp-y-PylmHpHp 




150) 


5 


-w 


A 


A 


C 


A 


W-3' 


150) PyPyPyPy-y-HpImHpHp 




151) 


5 


-w 


A 


A 


C 


G 


W-3' 


151) PyPyPylm-y-PylmHpHp 


35 


152) 


5 




A 


A 


C 


C 


W-3' 


152 ) PyPyPyPy-y- ImlmHpHp 
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TABLE 19: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WAGNNW-3' 









DNA sequence 


aromatic amino acid sequence 


5 


153) 


5 


A G T T W-3 ' 


153 ) PylmHpHp-y-PyPyPyHp 




154) 


5 


A G T A W-3« 


154) PyltnHpPy-y-HpPyPyHp 


10 


155) 


5 


'-W A G T G W-3 > 


155) PylmHpIm-y-PyPyPyHp 


156) 


5 


• -W A G T C W-3 • 


156 ) PylmHpPy-y- ImPyPyHp 




157) 


5 


• -W A G A T W-3 • 


157 ) PylmPyHp-y-PyHpPyHp 


15 


158) 


5 


'-W A G A A W-3' 


158 ) PylmPyPy-y-HpHpPyHp 




159) 


5 


'-W A G A G W-3' 


15 9 ) PylmPy Im-y- PyHpPyHp 


20 


160) 


5 


»-W A G A C W-3' 


160 ) Py ImPyPy-y- ImHpPyHp 




161) 


5 


'-W A G G T W-3' 


161) PylmlmHp-y-PyPyPyHp 




162) 


5 


•-W A G G A W-3' 


1 6 2 ) Py Im ImPy - y - Hp Py PyHp 


25 


163) 


5 


' -W A G C T W-3 ' 


1 6 3 ) Py ImPyHp -y- Py ImPyHp 




164) 


5' 


-W A G C A W-3 • 


164) PylmPyPy-y- Hp ImPyHp 


30 


165) 


5' 


-W A G G G W-3' 


165) Pylmlmlm-y-PyPy PyHp 




166) 


5' 


-W A G G C W-3 ' 


166) E>y^^I^Py"Y"I"^PyPyKp 




167) 


5' 


-W A G C G W-3' 


167) PylmPylm-y-Py ImPyHp 


35 


168) 


5' 


-W A G C C W-3' 


168) PylmPyPy-y-ImlmPyHp 
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TABLE 20 



8-ring Hairpin Polyamides for recognition of 6'bp S'-WACNNW-3' 



DNA sequence 



aromatic amino acid sequence 



5 


169) 


5' 


-W A 


C 


T 


T 


W-3' 


169) PyPyHpHp-Y-PyPylmHp 




170) 


5» 


-W A 


c 


T 


A 


W-3' 


170) PyPyHpPy-y-HpPylmHp 




171) 


5' 


-W A 


c 


T 


0 


W-3' 


171) PyPyHpIm-Y-PyPylmHp 


10 




















172) 


5» 


-W A 


c 


T 


C 


W-3» 


172 ) PyPyHpPy-y- ImPylmHp 




173) 


5' 


-W A 


c 


A 


T 


W-3' 


173) PyPyPyHp-Y-PyHpImHp 


15 


174) 


5' 


-W A 


c 


A 


A 


W-3' 


174) PyPyPyPy-y-HpHpImHp 




175) 


5' 


-W A 


c 


A 


G 


W-3' 


175) PyPyPylm-y-PyHpIinHp 




176) 


5* 


-W A 


c 


A 


C 


W-3' 


176) PyPyPyPy-y-ImHpImHp 


zu 




















177) 


5' 


-W A 


c 


G 


T 


W-3' 


177) PyPylrnHp-y-PyPylinHp 




178) 


5' 


-W A 


c 


0 


A 


W-3' 


178) PyPylmPy-y-HpPylmHp 


25 


179) 


5' 


-W A 


c 


C 


T 


W-3' 


179) PyPyPyHp-y-PylmlmHp 




180) 


5 


-W A 


c 


c 


A 


W-3' 


180) PyPyPyPy-y-HpImlmHp 




181) 


5 


' -W A 


c 


G 


G 


W-3' 


181) PyPylmlin-y-PyPylmHp 


30 




















182) 


5 


' -W A 


c 


G 


C 


W-3' 


1 8 2 ) Py Py I tnPy - y - 1 mPy ImHp 




183) 


5 


' -W A 


c 


C 


G 


W-3' 


183) PyPyPylm-y-PylmlmHp 


35 


184) 


5 


' -W A 


c 


C 


C 


W-3' 


184 ) PyPyPyPy-y- ImlmlmHp 
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TABLE 21: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCTNNW'3^ 



DNA sequence aromatic amino acid sequence 



5 


185) 


5« 


-W 


c 


T 


T 


T 


W-3' 


185) PyHpHpHp-Y-PyPyPylm 




186) 


5 ' 


-W 




T 


T 


A 








187) 


5» 


-W 


c 


T 


T 


Q 


W-3» 


187) PyHpHpIm-y-PyPyPylm 


10 






















188) 


5' 


-w 


c 


T 


T 


C 


W-3« 


188) PyHpHpPy-Y-ImPyPylm 




189) 


5« 


-w 


c 


T 


A 


T 


W-3' 


189) PyHpPyHp-y-PyHpPylm 


15 


190) 


5' 


-w 


c 


T 


A 


A 


W-3' 


190) PyHpPyPy-y-HpHpPylm 












rp 
X 


A 


n 




191; pyHppyim-y-pyHppylin 




192) 


5' 


-w 


c 


T 


A 


C 


W-3» 


192) PyHpPyPy-y-ImHpPylm 


20 






















193) 


5' 


-w 


c 


T 


G 


T 


W-3* 


193 ) PyHpImHp-y-PyPyPylm 






i; 1 

3 


TaT 

fi 


r* 
\^ 


T 
1 




A 


Vf ~ J 


194) PyHpIitiPy-y-HpPyPylm 


25 


195) 


5" 


-W 


c 


T 


0 


G 


W-3» 


195) PyHpImlm-y-PyPyPylm 




196) 


5 


-w 


c 


T 


G 


C 


W-3' 


196) PyHpImPy-y-ImPyPylm 




197) 


5 


-w 


c 


T 


C 


T 


W-3« 


197) PyHpPyHp -y- PyltnPylm 


30 






















198) 


5 


• -w 


c 


T 


C 


A 


W-3' 


198) PyHpPyPy-y-HpImPylm 




199) 


5 


' -w 


c 


T 


C 


G 


W-3' 


199) PyHpPylm-y-PylmPylm 


35 


200) 


5 


» -w 


c 


T 


c 


C 


W-3' 


200) PyHpPyPy-y-ImlmPylm 
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TABLE 22: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCANNW-3' 



DNA sequence aromatic amino acid sequence 



5 


201) 


5' 




c 


A 


T 


T 


W-3» 


201) PyPyHpHp-y-PyPyHpIm 




202) 


5' 


-w 


c 


A 


T 


A 


W-3' 


202) PyPyHpPy-y-HpPyHpIm 


10 


AVO / 








A 


T 


O 


W-3 ' 


2 03) PyPyHpIm-y-PyPyHpIm 




204) 


5' 


-W 


c 


A 


T 


c 


W-3' 


204) PyPyHpPy-y-ImPyHpIm 




205) 


5' 


-w 


c 


A 


A 


T 


W.3' 


205) PyPyPyHp-y-PyHpHpIm 


15 


206) 


5' 


-w 


c 


A 


A 


A 


W-3' 


206) Py Py Py Py - y - HpHpHp Im 




207) 


5' 


-w 


c 


A 


A 


G 


W-3 ' 


207) PyPyPyltn-y-PyHpHpIm 


20 


^ w O / 




n 


r* 

V. 


A 


A 


k« 


IU_ ^ 1 
W- J 


2 08) PyPyPyPy-y-ImHpHpIm 




209) 


5' 


-W 


c 


A 


G 


T 


W-3' 


2 09) PyPylmHp-y-PyPyHpIm 




210) 


5' 


-W 


c 


A 


G 


A 


W-3' 


210) PyPylmPy-y-HpPyHpIm 


25 


211) 


5 ' 


-w 


c 


A 


G 


G 


W-3' 


211) PyPyltnlm-y-PyPyHpIm 




212) 


5' 


-w 


c 


A 


G 


C 


W-3 ' 


212) PyPylmPy-y - ImPyHpIm 


30 


213) 


5" 


-w 


c 


A 


C 


T 


W-3' 


213) PyPyPyHp-y-PylmHpIm 




214) 


5' 


-w 


c 


A 


C 


A 


W-3' 


214 ) PyPyPyPy-y-HpImHpItn 




215) 


5' 


-w 


c 


A 


c 


G 


W.3' 


215) PyPyPylm-y-PylmHpIm 


35 


216) 


5' 


-w 


c 


A 


c 


C 


W-3' 


216) PyPyPyPy-y-ImlmHpIm 
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TABLE 23: 8-rmg Hairpin Polyamides for recognition of 6-bp S'-WCGNNW-3' 
DNA sequence aromatic amino acid sequence 



5 


217) 


5 


C 


G 


T 


T W-3' 


217) PylmHpHp-y-PyPyPylm 




218) 


5 


' -W C 


G 


T 


A W-3' 


218) PylmHpPy-y-HpPyPylm 


10 


219) 


5 


C 


G 


T 


G W-3' 


219) PylmHpIm-Y-PyPyPylm 


220) 


5 


c 


G 


T 


C W-3' 


220) PylmHpPy-y-ImPyPylm 




221) 


5 


c 


G 


A 


T W-3' 


221) PylmPyHp-Y-PyHpPylm 


15 


222) 


5 


c 


G 


A 


A W-3' 


222 ) PylmPyPy-Y-HpHpPylm 




223) 


5 


c 


G 


A 


G W-3' 


223) PylmPylm-y-PyHpPylm 


20 


224) 


5 


c 


G 


A 


C W-3' 


224 ) PylmPyPy-y- ImHpPylm 




225) 


5 


c 


G 


G 


T W-3' 


225) PylmlmHp-y-PyPyPylm 




226) 


5 


c 


G 


G 


A W-3' 


22$) PylmlmPy-y-HpPyPylm 


25 


227) 


5 


»-w c 


G 


C 


T W-3' 


227) PylmPyHp-y-PylmPylm 




228) 


5' 


-W C G C A W-3" 


228) PylmPyPy-y-HpImPylm 


30 


09) 


5' 


-H C G 6 G W-3* 


G9) Pylmlmlm-y-PyPyPylm 




610) 


5' 


-W C G G C W-3' 


GIO) PyImImE>y-Y-ImPyPyIm 




Gil) 


5' 


-W C G C G W-3» 


Gil) PylmPylm-y-PylmPylm 


35 


G12) 


5' 


-W C G C C W-3' 


012 ) PylmPyPy-y- ImlmPylm 
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TABLE 24: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WCCNNW-3' 



DNA sequence aromatic an^no acid sequence 



c 

J 


Z ^ 7 / 


ti I 
3 


.u 

"W 






1 


T 


M_ 1 1 
W- J ' 


229) PyPyHpHp-y-PyPylmlm 














rr% 
X 


A 




230) PyPyHpPy-y-HpPylmlm 


10 


231) 


5' 


-W 


C 


c 


T 


G 


W-3» 


231) PyPyHpIm-y-PyPylmlm 




O 9 o \ 


c ■ 


-W 




c 


T 


c 


w-3 ' 


232 ) PyPyHpPy-y- ImPylmlm 




233; 


5 ' 


-W 


C 


c 


A 


T 


W-3 ' 


2 3 3 ) Py Py PyHp - y - PyHp Iml m 


15 


234) 


5 ' 


-w 


c 


c 


A 


A 


W-3 * 


234) PyPyPyPy-y-HpHpImlm 




235) 


5 ' 


-W 


c 


c 


A 


G 


W-3 ' 


235) PyPyPylm-y-PyHpItnIm 


20 


236) 


5' 


-w 


c 


c 


A 


C 


W-3* 


236) PyPyPyPy-y-ImHpImlra 




237) 


5 ' 


-W 


c 


c 


0 


T 


W-3 ' 


237) PyPylmHp-y-PyPylmltn 




238) 


5 ' 


-w 


c 


c 


0 


A 


W-3 ' 


238) PyPylmPy-y-HpPylmlm 


25 


239) 


5' 


-w 


c 


c 


c 


T 


W-3' 


239) PyPyPyHp-y-Pylmlmlm 




240) 


5' 


-w 


c 


c 


c 


A 


W-3» 


240) PyPyPyPy-y-HpImlmlm 


30 


013) 


5 




c 


c 


0 


0 


W.3' 


G13) PyPylmlm-y-PyPylmlm 




014) 


5 


'-W 


c 


c 


0 


c 


W-3' 


G14 ) PyPylmPy-y- ImPylmlm 




015) 


5 




c 


c 


c 


0 


W-3' 


G15) PyPyPylm-y-Pylmlmlm 


35 


016) 


5 


• -w 


c 


c 


c 


c 


W.3' 


G16) PyPyPyPy-y-Imlmlmlm 



EXAMPLE 9: 

Aliphatic/Aromatic amino acid pairing for recognition of the DNA minor groove. 

40 

Selective placement of an aliphatic p-alanine (P) residue paired side-by-side with either 
a pyrrole (Py) or imidazole (Im) aromatic amino acid is found to compensate for sequence 
composition effects for recognition of the minor groove of DNA by hairpin pyrrole-imidazole 
polyamides. A series of polyamides were prepared which contain pyrrole and imidazole 
45 aromatic amino acids, as well as y-aminobutyric acid (y) "turn" and P-alanine "spring" aliphatic 
amino acid residues. The binding affinities and specificities of these polyamides are regulated 
by the placement of paired p/p Py/p and Im/p residues. Quantitative footprint titrations 
demonstrate that replacing two Py/Py pairings in a 12-ring hairpin (6-y-6) with two Py/p 
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pairings affords 1 0-fold enhanced affinity and similar sequence specificity for an 8-bp target 
sequence. 

Table 25 Equilibrium association constflirts (M' * ) for polyamkto.*^ 



Potyamide 3' 


•TOTTAACAO' 


5-TCTOAACA-3' 


Specificity*' 


•<KXXXX 

»-o-ooooo^ 


2.5 K tO« 


3.9 X 10" 


6 




1.3 X 10' 


2.0xl0» 


7 


•-OOOOCK 


1.7 n lO** 


2.7 X 10" 


6 




Ox 10" 


2.2 X 10' 


55 


•-0-OOO-CK 
9-<KXH>OOV 


6.6 X 10' 


2.5 X 10" 


26 


•-0O0-CKX 


4.5 X 10" 


7.7 X 10» 


6 


•OCKXKX 


2.7 X 10" 


5.7 X 10' 


5 




£ 1 X to* 


s 1 X 10* 


1 



^Values reported are the mean values obtained from three ONase I 
footprint titration experiments. ^The assays wcie carried out ai 22 at 
pH 7.0 in the presence of 10 mM Tris-HQ, 10 mM KQ. 10 mM MgClj. 
and 5 mM CaQv ^ Match site association constants and specificities 
higher than the parent hairpin arc shown in boldtypc. ^Specificity is 
calculated as match) / mismatch). 

The 6-Y-6 hairpin ImPylmPyPyPy-y-ImPyPyPyPyPy-P-Dp, which contains six 
consecutive amino acid pairings, is unable to discriminate a single-base-pair mismatch site 5'- 
TGTTAACA-3' from a 5'-TGTGAACA-3' match site. The hairpin polyamide Im-P- 
ImPyPyPy-y-ImPyPyPy-P-Py-p-Dp binds to the 8-bp match sequence 5'-TGTGAACA-3' with 
an equilibrium association constant of Ka = 2.4 x 10^^ M"' and > 48-fold specificity versus the 
5'-TGTTAACA-3' single-base-pair mismatch site. 
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Table 26 Equilibrium association constants {M'h for poiyamides.*'-^ 



Polyamide 


y-TOTTAACAO* 


3'-TGTGAACA-3' 


Specificity*' 


•OOOOCX 

»-<><><>o-c>oV 


2 J K 10« 


3-9 X 10' 


6 


•-O-OOCKX 


6.6x10* 


2.5 X 10" 


26 


«-CK>0<>CK>#^ 




5x10* 


I 


$H><><)-0<X)V 


s5x 10' 


2.4 X 10>» 


^48 



^ Values rqxurted for 1. 5. and 10 are the mean values obtained from 
three DNase 1 footprint titration experiments. '^The assays were carried 
out at 22 at pH 7.0 in the presence of 10 mM Tris-HO. 10 mM KQ, 
10 mM MgQi* and 5 mM CaClz. ^ Match site association constants 
and specificities higher than parent hairpins are shown in 
boldtype. ''Specificity is calculated as K^^Cmatch^ / KgCmismatch). 

Modeling indicates that the P-alanine residue relaxes ligand curvature, providing for 
optimal hydrogen bond formation between the floor of the minor groove and both Im-residues 
v^^ithin the Im-P-Im polyamide subunit. This observation provided the basis for design of a 
hairpin polyamide, Im-P-ImPy-y-Im-P-ImPy-P-Dp, which incorporates Im/p pairings to 
recognize a "problematic" 5'-GCGC-3' sequence at subnanomolar concentrations. 



Table 27 Equilibrium association constamts (M* * ) for polyamides.^'^ 

Polyamide 5'-TGCGCA-3' 5V-TGGCCA-3' 5*-TGGGGA-3' 





< 10^ 


< 10^ 


^^>oi^ 3.7x10^ 


1.4 X 10^ 


1.1 X 10^ 



Values reported arc the mean values obtzained from a minimum of three 
DNase I footprint titration experiments. '^The assays were carried out at 
22 *C at pH 7.0 in the presence of 10 mM! Tris-HCl, 10 mM KCl, 10 mM 
MgCl2, and 5 mM CaClz- 



These results identify Im/p and p/Im pairings that respectively discriminate G»C and 
C»G from A»T/T*A as well as Py/p and p/Py pairings that discriminate A»T/T»A from 
G»C/C«G. These aliphatic/aromatic amino acid pairings will facilitate the design of hairpin 
polyamides which recognize both a larger binding site size as well as a more diverse sequence 
repertoire. 

EXAMPLE 10: 
POLYAMIDE BIOTIN CONJUGATES 

Bifunctional conjugates prepared between sequence specific DNA binding polyamides 
and biotin are useful for a variety of applications. First, such compounds can be readily attached 
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to a variety of matrices through the strong interaction of biotin with the protein streptavidin. 
Readily available strepdavidin-derivatized matrices include magnetic beads for separations as 
well as resins for chromatography. 

A niunber of such polyamide-biotin conjugates have been synthesized by solid phase 
synthetic methods outlined in detail above. Following resin cleavage with a variety of diamines, 
the polyamides were reacted with various biotin carboxylic acid derivatives to yield 
bifunctional conjugates. The bifunctional conjugates were purified by HPLC and characterized 
by MALDI-TOF mass spectroscopy and *H NMR. 

The scheme for the synthesis of an exemplary biotin-polyamide conjugate is shown 

below. 

p-Py-Py-Py-lm-Y-Py-Py-Py-Inn 

Resin 




o 




The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing firom the true spirit and scope of the invention. 
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What is claimed is: 

1 . In a polyamide having at least three consecutive carboxamide pairs for 
binding to at least three DNA base pabs in the minor groove of a duplex 
DNA sequence having at least one A^T or T*A DNA base pair, the 
improvement comprising selecting a Hp/Py carboxamide pair to 
correspond to a T«A base pair in the minor groove of the duplex DNA 
sequence or selecting a Py/Hp carboxamide pair to bind to an A*T DNA 
base pair in the minor groove of the duplex DNA sequence. 

2. The polyamide of claim 1 wherein at least four consecutive carboxamide 
pairs bind to at least four DNA base pairs. 

3. The polyamide of claim 1 wherein at least five consecutive carboxamide 
pairs bind to at least five DNA base pairs. 

4. The polyamide of claim 1 wherein at least six consecutive carboxamide 
pairs bind to at least six DNA base pairs. 

5. The polyamide of claim 1 wherein the A«T or T»A base pair has a G»C 
or C«G base pair on either side. 

6. The polyamide of claim 1 wherein the duplex DNA sequence is a 
regulatory sequence. 

7. The polyamide of claim 1 wherein the duplex DNA sequence is a 
promoter sequence. 

8. The polyamide of claim 1 wherein the duplex DNA sequence is a coding 
sequence. 

9. The polyamide of claim 1 wherein the duplex DNA sequence is a non- 
coding sequence. 

10. The polyamide of claim 1 wherein the binding of the carboxamide pairs 
to the DNA base pairs modulates the expression of a gene. 
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11. A composition comprising an effective amount of the polyamide of claim 
1 and a pharmologically suitable excipient. 

1 2. A diagnostic kit comprising the polyamide of claim 1 . 

13. A polyamide according to claim 1 having the formula: 

X1X2X3X4-Y-X5X6X7X8 
wherein y is -NH-CH2-CH2-CH2-CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X3/X6. X2/X7. and Xi/Xg represent 
carboxamide binding pairs which bind the DNA base pairs wherein at 
least one binding pair is Hp/Py or Py/Hp and the other binding pairs are 
selected from Py/Im Im/Py to correspond to the DNA base pair in the 
minor groove to be bound. 

14. The polyamide of claim 13 wherein there is at least one p-alanine in a 
non- Hp containing binding pair. 

15. The polyamide of claim 13 wherein dimethylaminopropylamide is 
covalently bound to Xi or Xg. 

16. A polyamide selected from those Hsted in Tables 9-24 as 
compounds 1 through 240. 

17. A polyamide selected from shown in Fig. 4. 
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0 
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2/17 



5*-T O 0(11 C ^-3* 5*.T O Offlc A.3* 

3'-A C Cgjo 3'-A C C^Q T-S' 

Py/Py with T»A Py/Py with A»T 



5*-T o gRF]c 

3'-A C C T-5* 
Py/Hp with T«A 



5'-T O GpHc A*3' 
3*-A C C aJo T-5' 

Py/Hp with A«T 



5*-T G O^C A-3' 5'-T G C [aI C A-3* 

+>OOO|0l^^ +»ooo#^ 

3*-A C cIaJG T-5* 3*-A C C [tJ C X-5* 

Hp/Py with T»A Hp/Py with A»T 
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3/17 
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